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(57) Abstract 

Streptococcus proteins and polynucleotides encoding them are disclosed. Said proteins arc antigenic and therefore useful vaccine 
components for the prophylaxis or therapy of streptococcus infection in animals. Also disclosed are recombinant methods of producing the 
protein antigens as well as diagnostic assays for detecting streptococcus bacterial infection. 
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NOVEL STREPTOCOCCUS ANTIGENS 
FIELD OF THE INVENTION 

The present invention is related to antigens, more 
particularly protein antigens of streptococcus 
pneumoniaepathogen which are useful as vaccine components 
for therapy and/or prophylaxis. 

BACKGROUND OF THE INVENTION 

S. pneumoniae is an important agent of disease in man 
especially among infants, the elderly and immunocompromised 
persons. It is a bacterium frequently isolated from 
patients with invasive diseases such as 

bacteraemia/ septicaemia, pneumonia, meningitis with high 
morbidity and mortality throughout the world. Even with 
appropriate antibiotic therapy, pneumococcal infections 
still result in many deaths. Although the advent of 
antimicrobial drugs has reduced the overall mortality from 
pneumococcal disease, -the presence of resistant pneumococcal 
organisms has become a major problem in the world today. 
Effective pneumococcal vaccines could have a major impact on 
the morbidity and mortality associated with pneumoniae 
disease. Such vaccines would also potentially be useful to 
prevent otitis media in infants and young children. 

Efforts to develop a pneumococcal vaccine have generally 
concentrated on generating immune responses to the 
pneumococcal capsular polysaccharide. More than 80 
pneumococcal capsular serotypes have been identified on the 
basis of antigenic differences. The currently available 
pneumococcal vaccine, comprising 23 capsular polysaccharides 
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that most frequently caused disease, has significant 
shortcomings related primarily to the poor immunogenicity of 
some capsular polysaccharides, the diversity of the 
serotypes and the differences in the distribution of 
5 serotypes over time, geographic areas and age groups. In 
particular, the failure of existing vaccines and capsular 
conjugate vaccines currently in development to protect young 
children against all serotypes spurres evaluation of other 
S . pneumoniae components. Although immunogenicity of 

10 capsular polysaccharides can be improved, serotype 

specificity will still represent a major limitation of 
polysaccharide-based vaccines. The use of a antigenically 
conserved immunogenic pneumococcal protein antigen, either 
by itself or in combination with additional components, 

15 offers the possibility of a protein-based pneumococcal 
vaccine. 

PCT Publication number WO98/18930 published may 7 1998 
entitled "Streptococcus Pneumoniae antigens and vaccines" 
20 describes certain polypeptides which are claimed to be 
antigenic. However, no biological activity of these 
polypeptides is reported. 

Therefore their remains an unmet need for Streptococcus 
25 antigens that may be used as vaccine components for the 
prophylaxis and/or therapy of Streptococcus infection. 

SUMMARY OF THE INVENTION 

30 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 6, 8 # 10, 14 , 16, 55 

2 
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to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof . 

In other aspects, there are provided vectors comprising 
5 polynucleotides of the invention operably linked to an 
expression control region, as well as host cells 
transfected with said vectors and methods of producing 
polypeptides comprising culturing said host cells under 
conditions suitable for expression. 
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In yet another aspect, there are provided novel 
polypeptides encoded by polynucleotides of the invention. 



15 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is the DNA sequence of BVH-3 gene; SEQ ID NO: 1. 

Figure 2 is the amino acid sequence of BVH-3 protein; SEQ ID 
20 NOs 2. 

Figure 3 is the DNA sequence of BVH-11 gene; SEQ ID NO: 3. 

Figure 4 is the amino acid sequence of BVH-11 protein; SEQ 
25 ID NOs 4. 

Figure 5 is the DNA sequence of BVH-28 gene; SEQ ID NOs 5. 

Figure 6 is the amino acid sequence of BVH-28 protein; SEQ 
30 ID NOs 6. 

Figure 7 is the DNA sequence of BVH-3 A gene which 

corresponds to the 5' terminal end of BVH-3; SEQ ID NO: 7. 
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Figure 8 is the amino acid sequence of BVH-3A protein; SEQ 
ID NO: 8. 

Figure 9 is the DNA sequence of BVH-3B gene which 
5 corresponds to the 3' terminal end of BVH-3; SEQ ID NO: 9. 

Figure 10 is the amino acid sequence of BVH-3B protein; SEQ 
ID NO: 10. 

10 

Figure 11 depicts the comparison of the predicted amino 
acid sequences of the BVH-3 open reading frames from WU2, 
RX1, JNR.7/87, SP64, P4241 and A66 pneumoniae strains 
by using the program Clustal W from MacVector sequence 
15 analysis software (version 6.5). Underneath the alignment, 
there is a consensus line where * and . characters indicate 
identical and similar amino acid residues, respectively. 

Figure 12 depicts the comparison of the predicted amino 
20 acid sequences of the BVH-11 open reading frames from WU2, 
Rxl, JNR.7/87, SP64, P4241, A66 and SP63 pneumoniae 
strains by using the program Clustal W from MacVector 
sequence analysis software (version 6.5). Underneath the 
alignment, there is a consensus line where * and . 
25 characters indicate identical and similar amino acid 
residues , respectively . 

Figure 13 depicts the comparison of the predicted amino 
acid sequences of the BVH-11 proteins from various 
30 pneumoniae strains. The degrees of identity (I) and 

similarity (S) were determined by using the program Clustal 
W from MacVector sequence analysis software (version 6.5). 

Figure 14 is a DNA sequence containing the complete BVH-3 
35 gene (open reading frame "ORF" at nucleotides 1777 to 
4896) ; SEQ ID NO: 11. 

4 
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Figure 15 is a DNA sequence containing the complete BVH-11 
gene <ORF at nucleotides 45 to 2567); SEQ ID NO* 12. 

5 Figure 16 is a DNA sequence containing the complete BVH-11- 
2 gene (ORF at nucleotides 114 to 2630); SEQ ID NO: 13. 

Figure 17 is the amino acid sequence of BVH-11-2 protein; 
SEQ ID NO: 14. 

10 

Figure 18 is the DNA sequence of SP63 BVH-3 gene; SEQ ID 
NO: 15. 

Figure 19 is the amino acid sequence of SP63 BVH-3 protein; 
15 SEQ ID NO: 16. 

Figure 20 is the amino acid sequence of BVH-3M protein; SEQ 
ID NO: 55. 

20 Figure 21 is the amino acid sequence of BVH-3 AD protein; 
SEQ ID NO: 56. 

Figure 22 is the amino acid sequence of L-BVH-3-AD protein; 
SEQ ID NO: 57. 

25 

Figure 23 is the amino acid sequence of NEW 12 protein; SEQ 
ID NO: 58. 

Figure 24 is the amino acid sequence of BVH-3C protein; SEQ 
30 ID NO: 59. 

Figure 25 is the amino acid sequence of BVH-11M protein; 
SEQ ID NO: 60. 

35 Figure 26 is the amino acid sequence of BVH-11A protein; 
SEQ ID NO: 61. 
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Figure 27 is the amino acid sequence of BVH-11B (also 
called Newl3) protein; SEQ ID NO: 62. 

5 Figure 28 is the amino acid sequence of BVH-11C protein; 
SEQ ID NO: 63. 

Figure 29 is the amino acid sequence of NEW1 protein; SEQ 
ID NO: 64. 

10 

Figure 30 is the amino acid sequence of NEW2 protein; SEQ 
ID NO: 65. 

Figure 31 is the amino acid sequence of NEW3 protein; SEQ 
15 ID NO: 66. 

Figure 32 is the amino acid sequence of NEW4 protein; SEQ 
ID NO: 67. 

20 Figure 33 is the amino acid sequence of NEW5 protein; SEQ 
ID NO: 68. 

Figure 34 is the amino acid sequence of NEW6 protein; SEQ 
ID NO: 69. 

25 

Figure 35 is the amino acid sequence of NEW7 protein; SEQ 
ID NO: 70. 

Figure 36 is the amino acid sequence of NEW8 protein; SEQ 
30 ID NO: 71. 

Figure 37 is the amino acid sequence of NEW9 protein; SEQ 
ID NO: 72. 

35 Figure 38 is the amino acid sequence of BVH-11-2M protein; 
SEQ ID NO: 73. 

6 
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Figure 39 is the amino acid sequence of NEW10 protein; SEQ 
ID NO: 74. 



5 Figure 40 is the amino acid sequence of NEW11 protein; SEQ 
ID NO: 75. 

Figure 41 is the DNA sequence of NEW12 gene; SEQ ID NO: 76. 

10 Figure 42 is the amino acid sequence of NEW14 protein; SEQ 
ID NO: 77. 



Figure 43 is the amino acid 
ID NO: 78. 

Figure 44 is the amino acid 
ID NO: 79. 



sequence of NEW15 protein; SEQ 



sequence of NEW16 protein; SEQ 



Figure 45 is the DNA sequence of GBS BVH-71 gene; SEQ ID 
20 NO: 80. 



Figure 46 is the amino acid sequence of GBS BVH-71 

protein; SEQ ID NO: 81. 

25 - Figure 47 is the DNA sequence of GAS BVH-71 gene; SEQ ID 
NO: 82. 



Figure 48 is the amino acid sequence of GAS BVH-71 protein; 
SEQ ID NO: 83. 

30 

DETAILED DESCRIPTION OF THE INVENTION 

i 

According to one aspect, the present invention provides an 
35 isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 

7 
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to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof . 

According to one aspect, the present invention provides an 
5 isolated polynucleotide encoding a polypeptide having at 
least 95% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 
to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof. 

10 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 
15 75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
20 least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 10, 14, 16, 55 to 
75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof. 

25 According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 
75, 77 to 79 or fragments, analogs or derivatives thereof. 

30 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 8, 10, 16, 55, 56, 57, 
35 58, 59, 64, 65, 66, 78 or fragments, analogs or derivatives 
thereof . 

8 
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According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
5 sequence chosen from SEQ ID NOs: 2, 8, 10, 16, 55, 56, 57, 
59, 64, 65, 66, 78 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
10 isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 4, 14, 58, 60, 61, 62, 63, 
67, 68, 69, 70, 71, 72, 73, 74, 75, 77, 79 or fragments, 
analogs or derivatives thereof. 

15 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 4, 14, 60, 61, 62, 63, 67, 
20 68, 69, 70, 71, 72, 73, 74, 75, 77, 79 or fragments, 
analogs or derivatives thereof . 

According to one aspect, the present invention provides an 
25 isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 10, 14, 16, 55 to 
75, 77 to 79 or fragments, analogs or derivatives thereof. 

30 According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
leasts 7 0% identity to a second polypeptide comprising 
sequence chosen from SEQ ID NOs: 10, 55 to 75, 77, 78, 79 
or fragments, analogs or derivatives thereof. 

35 

According to one aspect, the present invention provides an 

9 
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isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence chosen from SEQ ID NOs: 55 to 75, 77, 78, 79 or 
fragments, analogs or derivatives thereof. 

5 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 6, 8, 10 or 
10 fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
15 least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 10, 14, 16 or 
fragments, analogs or derivatives thereof. 

According to one aspect, the present invention provides an 
20 isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising a 
sequence chosen from SEQ ID NOs: 2, 4, 14, 16 or fragments, 
analogs or derivatives thereof . 

25 According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 2 or fragments, analogs or derivatives 
thereof . 

30 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 4 or fragments, analogs or derivatives 
35 thereof . 

10 



BNSOOCID <WO 0039299 A2_l_> 



WO 00/39299 



PCT/CA99/01218 



According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 10 or fragments, analogs or derivatives 
5 thereof. 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
10 sequence SEQ ID NO: 14 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
15 least 70% identity to a second polypeptide comprising 

sequence SEQ ID NO: 16 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
20 isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 58 or fragments, analogs or derivatives 
thereof . 

25 According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 60 or fragments, analogs or derivatives 
thereof . 

30 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 62 or fragments, analogs or derivatives 
35 thereof . 

11 



BNSDOCIO: <WO O039299A2_l_> 



WO 00/39299 PCT/CA99/0121 8 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 64 or fragments, analogs or derivatives 
5 thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
10 sequence SEQ ID NO: 67 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
15 least 70% identity to a second polypeptide comprising 

sequence SEQ ID NO: 68 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention provides an 
20 isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 69 or fragments, analogs or derivatives 
thereof . 

25 According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 72 or fragments, analogs or derivatives 
thereof . 

30 

According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 74 or fragments, analogs or derivatives 
35 thereof . 

12 
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According to one aspect, the present invention provides an 
isolated polynucleotide encoding a polypeptide having at 
least 70% identity to a second polypeptide comprising 
sequence SEQ ID NO: 77 or fragments, analogs or derivatives 
5 thereof . 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 6, 8, 10 or fragments, 
10 analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 
15 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
20 chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 75, 77 
to 79, 81, 83 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates, to 
polypeptides characterized by the amino acid sequence 
25 chosen from SEQ ID NOs: 2, 4 , 10, 14, 16, 55 to 75, 77 to 
79, 81, 83 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
30 chosen from SEQ ID NOs: 2, 4, 8, 10, 14, 16, 55 to 75, 77 
to 79 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
35 chosen from SEQ ID NOe: 2, 4, 10, 14, 16, 55 to 75, 77 to 
79 or fragments, analogs or derivatives thereof. 

13 
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According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 2, 4, 10, 14, 16 or fragments, 
5 analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 2 or fragments, analogs or 
10 derivatives thereof . 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 4 or fragments, analogs or 
15 derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 10 or fragments, analogs or 
20 derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 14 or fragments, analogs or 
25 derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 16 or fragments, analogs or 
30 derivatives thereof . 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NOs: 10, 55 to 75, 77, 78, 79 or 
35 fragments, analogs or derivatives thereof. 
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According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NO: 10, 58, 60, 62, 64, 67, 68, 69, 72, 
74, 77 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NO: 10, 58, 60, 62, 64, 67, 68, 69, 72, 
74, 77 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NO: 10, 58, 60, 62, 64, 67, 68, 69, 72, 
74, 77 or fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
chosen from SEQ ID NO: 10, 62, 64, 67, 68, 74, 77 or 
fragments, analogs or derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 58 or fragments, analogs or 
derivatives thereof. 

According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 62 or fragments, analogs or 
derivatives thereof. 



According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
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polypeptides characterized by the amino acid sequence 
comprising sequence SEQ id NO: 67 or fragments, analogs or 
derivatives thereof . 

5 According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 68 or fragments, analogs or 
derivatives thereof. 

10 According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 74 or fragments, analogs or 
derivatives thereof . 

15 According to one aspect, the present invention relates to 
polypeptides characterized by the amino acid sequence 
comprising sequence SEQ ID NO: 77 or fragments, analogs or 
derivatives thereof. 

20 In a further embodiment, the present invention also relates 
to chimeric polypeptides which comprise one or more 
polypeptides or fragments, analogs or derivatives thereof 
as described in the present application. 

25 In a further embodiment, the present invention also relates 
to chimeric polypeptides which comprise one or more 
polypeptides or fragments, analogs or derivatives thereof 
as defined in the figures of the present application. 

30 In a further embodiment, the present application also 

relates to chimeric polypeptides which comprise two or more 
polypeptides chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 
16, 55 to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof ; provided that the polypeptides or 

35 fragments, analogs or derivatives thereof are linked as to 
form a chimeric polypeptide. 
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In a further embodiment, the chimeric polypeptide will 
comprise two or more polypeptides chosen from SEQ ID 
®0& slO, 58, 60, 62, 64, 67, 68, 69, 72, 74, 77 or 
5 fragments, analogs or derivatives thereof; provided that 
the polypeptides or fragments, analogs or derivatives 
thereof are linked as to form a chimeric polypeptide. 

In a further embodiment, the chimeric polypeptide will 
10 comprise two or more polypeptides chosen from SEQ ID 
S90s slO, 58, 60, 62, 64, 67, 68, 74, 77 or fragments, 
analogs or derivatives thereof; provided that the 
polypeptides or fragments, analogs or derivatives thereof 
are linked as to form a chimeric polypeptide. 

15 

In a further embodiment, the chimeric polypeptide will 
comprise two or more polypeptides chosen from SEQ ID 
NOs slO, 62, 64, 67, 68, 74, 77 or fragments, analogs or 
derivatives thereof; provided that the polypeptides or. 
20 fragments, analogs or derivatives thereof are linked as to 
form a chimeric polypeptide. 

In a further embodiment, the chimeric polypeptide will 
comprise between 2 and 5 polypeptides. 

25 

In a further embodiment, the chimeric polypeptide will 
comprise between 2 and 4 polypeptides. 

In a further embodiment, the chimeric polypeptide will 
30 comprise between 2 and 3 polypeptides. 

In a further embodiment, the chimeric polypeptide will 
comprise 2 polypeptides. 

35 
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In a further embodiment, there is provided a chimeric 
polypeptide of formula (I) : 
A-(B) m -<C) n -D (I) 

5 Wherein; 

m is 0 or 1, 
a is 0 or 1, 

A is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 

75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
10 thereof; 

B is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 

75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof ; 

C is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 

15 75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof ; and 

D is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 

75, 77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

20 

In a further embodiment, 

A is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof; 

25 B is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77, or fragments, analogs or derivatives 
thereof; 

C is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
30 thereof; and 

D is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 
69, 72, 74, 77 or fragments, analogs or derivatives 
thereof . 

35 In a further embodiment, 
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A is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

74, 77 or fragments, analogs or derivatives thereof; 
B is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 
74, 77, or fragments, analogs or derivatives thereof; 
5 C is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 
74, 77 or fragments, analogs or derivatives thereof; and 
D is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 
74, 77 or fragments, analogs or derivatives thereof. 

10 In one embodiment, chimeric polypeptides of the present 

invention comprise those wherein the following embodiments 
are present, either independently or in combination. 

In a further embodiment, A is SEQ ID NOs :10, 58, 62, 64, 
15 67, 68, 74, 77 or fragments, analogs or derivatives 
thereof . 

In a further embodiment, A is SEQ ID NO slO or fragments, 
analogs or derivatives thereof . 

In a further embodiment, A is SEQ ID NO :58 or fragments, 
20 analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :62 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof . 
25 In a further embodiment, A is SEQ ID NO :67 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :68 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :74 or fragments, 
30 analogs or derivatives thereof. 

In a further embodiment, A is SEQ ID NO :77 or fragments, 
analogs or derivatives thereof. 
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In a further embodiment, B is SEQ ID NOs :10, 58, 62, 64, 
67, 68, 74, 77 or fragments, analogs or derivatives 
thereof . 

In a further embodiment, B is SEQ ID NO :10 or fragments, 
5 analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :58 or fragments, 
analogs or derivatives thereof . 

In a further embodiment, B is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 
10 In a further embodiment, B is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :67 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :68 or fragments, 
15 analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO :74 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, B is SEQ ID NO : 77 or fragments, 
analogs or derivatives thereof . 

20 

In a further embodiment, C is SEQ ID NOs :10, 58, 62, 64, 
67, 68, 74, 77 or fragments, analogs or derivatives 
thereof . 

In a further embodiment, C is SEQ ID NO :10 or fragments, 
25 analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO :58 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO : 62 or fragments, 
analogs or derivatives thereof. 
30 In a further embodiment, C is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO : 67 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, C is SEQ ID NO : 68 or fragments, 
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analogs or derivatives thereof . 

In a further embodiment, C is SEQ ID NO : 74 or fragments, 
analogs or derivatives thereof . 

In a further embodiment, C is SEQ ID NO : 77 or fragments, 
5 analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :10, 58, 62, 64, 
67, 68, 74, 77 or fragments, analogs or derivatives 
thereof. 

10 In a further embodiment, D is SEQ ID NO :10 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :58 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :62 or fragments, 
15 . analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :64 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :67 or fragments, 
analogs or derivatives thereof. 
20 In a further embodiment, D is SEQ ID NO :68 or fragments, 

analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :74 or fragments, 
analogs or derivatives thereof. 

In a further embodiment, D is SEQ ID NO :77 or fragments, 
25 analogs or derivatives thereof. 

In a further embodiment, m is 0. 
In a further embodiment, n is 0. 

30 

In a 'further embodiment, m and n are 0. 

In a further embodiment, m and n are 0 , A is SEQ ID NO: 64 
or fragments, analogs or derivatives thereof, B is SEQ ID 
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NO: 62 or fragments, analogs or derivatives thereof. 
In a further embodiment, m and n are 0, A is SEQ ID NO: 62 
or fragments, analogs or derivatives thereof, B is SEQ ID 
NO: 64 or fragments, analogs or derivatives thereof. 

5 

In accordance with the present invention, all nucleotides 
encoding polypeptides and chimeric polypeptides are within 
the scope of the present invention. 

10 In a further embodiment, the polypeptides or chimeric 

polypeptides in accordance with the present invention are 
antigenic . 

In a further embodiment, the polypeptides or chimeric 
15 polypeptides in accordance with the present invention can 
elicit an immune response in an individual. 

In a further embodiment, the present invention also relates 
to polypeptides which are able to raise antibodies having 
20 binding specificity to the polypeptides or chimeric 

polypeptides of the present invention as defined above. 

An antibody that " has binding specif icity n is an antibody 
that recognizes and binds the selected polypeptide but 
25 which does not substantially recognize and bind other 
molecules in a sample, e.g., a biological sample, which 
naturally includes the selected peptide. Specific binding 
can be measured using an ELISA assay in which the selected 
polypeptide is used as an antigen. 

30 

Unless otherwise defined, all technical and scientific 
terms used herein have the same meaning as commonly 
understood by one of ordinary skill in the art to which 
this invention belongs. All publications, patent 
35 applications, patents, and other references mentioned 

herein are incorporated by reference in their entirety. In 
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case of conflict, the present specification, including 
definitions, will control. In addition, the materials, 
methods, and examples are illustrative only and not 
intended to be limiting. 

5 

As used herein, -fragments", "derivatives" or "analogs" of 
the polypeptides of the invention include those 
polypeptides in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved 

10 amino acid residue (preferably conserved) and which may be 
natural or unnatural. In one embodiment, derivatives and 
analogs of polypeptides of the invention will have about 
70% identity with those sequences illustrated in the 
figures or fragments thereof. That is, 70% of the residues 

15 are the same. In a further embodiment, polypeptides will 
have greater than 75% homology. In a further embodiment, 
polypeptides will have greater than 80% homology. In a 
further embodiment, polypeptides will have greater than 85% 
homology. In a further embodiment, polypeptides will have 

20 greater than 90% homology. In a further embodiment, 

polypeptides will have greater than 95% homology. In a 
further embodiment, polypeptides will have greater than 99% 
homology. In a further embodiment, derivatives and analogs 
of polypeptides of the invention will have fewer than about 

25 20 amino acid residue substitutions, modifications or 
deletions and more preferably less than 10. Preferred 
substitutions are those known in the art as conserved i.e. 
the substituted residues share physical or chemical 
properties such as hydrophobicity, size, charge or 

30 functional groups . 

In accordance with the present invention, polypeptides of 
the invention include both polypeptides and chimeric 
polypeptides . 

35 

Also included are polypeptides which have fused thereto 
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other compounds which alter the polypeptides biological or 
pharmacological properties i.e. polyethylene glycol (PEG) 
to increase half-life; leader or secretory amino acid 
sequences for ease of purification; prepro- and pro- 
5 sequences; and (poly) saccharides . 

Furthermore, in those situations where amino acid regions 
are found to be polymorphic, it may be desirable to vary 
one or more particular amino acids to more effectively 
10 mimic the different epitopes of the different streptococcus 
strains . 

Moreover, the polypeptides of the present invention can be 
modified by terminal -NH 2 acylation (eg. by acetylation, or 
15 thioglycolic acid amidation, terminal carbosy amidation, 
e.g. with ammonia or methylamine) to provide stability, 
increased hydrophobicity for linking or binding to a 
support or other molecule. 

20 Also contemplated are hetero and homo polypeptide multimers 
of the polypeptide fragments, analogues and derivatives. 
These polymeric forms include, for example, one or more 
polypeptides that have been cross- linked with cross-linkers 
such as avidin/biotin, gluteraldehyde or dimethyl- 

25 superimidate . Such polymeric forms also include 

polypeptides containing two or more tandem or inverted 
contiguous sequences, produced from multicistronic mKNAs 
generated by recombinant DNA technology. 
Preferably, a fragment, analog or derivative of a 

30 polypeptide of the invention will comprise at least one 
antigenic region i.e. at least one epitope. 

In order to achieve the formation of antigenic polymers 
(i.e. synthetic multimers), polypeptides may be utilized 
35 having bishaloacetyl groups, nitroarylhalides, or the like, 
where the reagents being specific for thio groups. 
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Therefore, the link between two mercapto groups of the 
different peptides may be a single bond or may be composed 
of a linking group of at least two, typically at least 
four, and not more than 16, but usually not more than about 
5 14 carbon atoms . 

In a particular embodiment, polypeptide fragments, analogs 
and derivatives of the invention do not contain a 
methionine (Met) starting residue. Preferably, 

10 polypeptides will not incorporate a leader or secretory 
sequence (signal sequence) . The signal portion of a 
polypeptide of the invention may be determined according to 
established molecular biological techniques. In general, 
the polypeptide of interest may be isolated from a 

15 streptococcus culture and subsequently sequenced to 

determine the initial residue of the mature protein and 
therefore the sequence of the mature polypeptide. 

According to another aspect, there are provided vaccine 
20 compositions comprising one or more streptococcus 
polypeptides of the invention in admixture with a 
pharmaceutical^ acceptable carrier diluent or adjuvant. 
Suitable adjuvants include oils i.e. Freund's complete or 
incomplete adjuvant; salts i.e. AlK(S0 4 ) 2 , AlNa(S0 4 ) 2 , 
25 AlNHJSOJj, silica, kaolin, carbon polynucleotides i.e. 

poly IC and poly AU. Preferred adjuvants include QuilA and 
Alhydrogel. Vaccines of the invention may be administered 
parenterally by injection, rapid infusion, nasopharyngeal 
absorption, dermoabsorption, or bucal or oral. 
30 Pharmaceutically acceptable carriers also include tetanus 
toxoid. 

Vaccine compositions of the invention are used for the 
treatment or prophylaxis of streptococcus infection and/or 
35 diseases and symptoms mediated by streptococcus infection 
as described in P.R. Murray (Ed, in chief), E.J. Baron, M.A. 
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Pfaller, F.C. Tenover and R.H. Yolken. Manual of Clinical 
Microbiology, ASM Press, Washington, D.C. sixth edition, 
1995, 1482p which are herein incorporated by reference. In 
one embodiment, vaccine compositions of the present 
5 invention are used for the treatment or prophylaxis of 

meningitis, otitis media, bacteremia or pneumonia. In one 
embodiment, vaccine compositions of the invention are used 
for the treatment or prophylaxis of streptococcus infection 
and/or diseases and symptoms mediated by streptococcus 
10 infection, in particular S .pneumoniae , group A 

streptococcus (pyogenes) , group B streptococcus (GBS or 
agalactiae) , dysgalactiae, uteris, nocardia as well as 
Staphylococcus aureus. In a further embodiment, the 
streptococcus infection is S .pneumoniae . 

15 

In a particular embodiment, vaccines are administered to 
those individuals at risk of streptococcus infection such 
as infants, elderly and immunocompromised individuals. 

20 As used in the present application, the term n individuals" 
include mammals. In a further embodiment, the mammal is 
human. 

Vaccine compositions are preferably in unit dosage form of 
25 about 0.001 to 100 ug/kg (antigen/body weight) and more 
preferably 0.01 to 10 Ug/kg and most preferably 0.1 to 1 
ug/kg 1 to 3 times with an interval of about 1 to 6 week 
intervals between immunizations. 

30 According to another aspect, there are provided 

polynucleotides encoding polypeptides characterized by the 
amino' acid sequence chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 
14, 16, 55 to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof. 

35 
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In one embodiment, polynucleotides are those illustrated in 
SEQ ID Nos: 1, 3, 5, 7, 9, 11, 12, 13, 15, 76, 80, 82 

which may include the open reading frames (ORF) , encoding 
polypeptides of the invention. It will be appreciated that 
5 the polynucleotide sequences illustrated in the figures may 
be altered with degenerate codons yet still encode the 
polypeptides of the invention. Accordingly the present 
invention further provides polynucleotides which hybridize 
to the polynucleotide sequences herein above described (or 

10 the complement sequences thereof) having 50% identity 

between sequences. In one embodiment, at least 70% identity 
between sequences. In one embodiment, at least 75% identity 
between sequences. In one embodiment, at least 80% identity 
between sequences. In one embodiment, at least 85% identity 

15 between sequences. In one embodiment, at least 90% identity 
between sequences. In a further embodiment, polynucleotides 
are hybridizable under stringent conditions i.e. having at 
least 95% identity. In a further embodiment, more than 97% 
identity. 

20 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs : 1, 3, 7, 9, 11, 12, 13, 15, 76, 
80, 82 encoding polypeptides of the invention. 

25 In a further embodiment, polynucleotides are those 

illustrated in SEQ ID NOs ; 1, 3, 9, 11, 12, 13, 15, 76, 
80, 82 which may include the open reading frames (ORF) , 
encoding polypeptides of the invention. 

30 In a further embodiment, polynucleotides are those 
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illustrated in SEQ ID NOs t 1, 3, 7, 9, 11, 12, 13, 15, 76 

which may include the open reading frames (ORF) , encoding 
polypeptides of the invention. 

5 In a further embodiment, polynucleotides are those 

illustrated in SEQ ID NOs : 1, 7, 9, 11, 15, 76 which may 
include the open reading frames (ORF) , encoding 
polypeptides of the invention. 

10 In a .further embodiment, polynucleotides are those 

illustrated in SEQ ID NOs : 1, 9, 11, 15, 76 which may 
include the open reading frames (ORF) , encoding 
polypeptides of the invention. 

15 In a further embodiment, polynucleotides are those 

illustrated in SEQ ID NOs : 1, 7, 9, 11 which may include 
the open reading frames (ORF) , encoding polypeptides of the 
invention. 

20 In a further embodiment, polynucleotides are those 

illustrated in SEQ ID NO : 1, encoding polypeptides of the 
invention. 

In a further embodiment, polynucleotides are those 
25 illustrated in SEQ ID NO :7, encoding polypeptides of the 
invention . 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO :9, encoding polypeptides of the 
30 invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO :11, encoding polypeptides of the 
invention. 

35 
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In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO sl5, encoding polypeptides of the 
invention. 

5 In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NOs : 3, 12, 13, 76, encoding 
polypeptides of the invention. 

In a further embodiment, polynucleotides are those 
10 illustrated in SEQ ID NO :3, encoding polypeptides of the 
invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO :12, encoding polypeptides of the 
15 invention. 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO sl3, encoding polypeptides of the 
invention. 

20 

In a further embodiment, polynucleotides are those 
illustrated in SEQ ID NO :76, encoding polypeptides of the 
invention. 

25 As will be readily appreciated by one skilled in the art, 
polynucleotides include both DNA and RNA. 

The present invention also includes polynucleotides 
complementary to the polynucleotides described in the 
30 present application. 

In a further aspect, polynucleotides encoding polypeptides 
of the invention, or fragments, analogs or derivatives 
thereof, may be used in a DNA immunization method. That 
35 is, they can be incorporated into a vector which is 
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replicable and expressible upon injection thereby producing 
the antigenic polypeptide in vivo. For example 
polynucleotides may be incorporated into a plasmid vector 
under the control of the CMV promoter which is functional 
5 in eukaryotic cells. Preferably the vector is injected 
intramuscularly . 

According to another aspect, there is provided a process 
for producing polypeptides of the invention by recombinant 

10 . techniques by expressing a polynucleotide encoding said 
polypeptide in a host cell and recovering the expressed 
polypeptide product. Alternatively, the polypeptides can 
be produced according to established synthetic chemical 
techniques i.e. solution phase or solid phase synthesis of 

15 oligopeptides which are ligated to produce the full 
polypeptide (block ligation) . 

General methods for obtention and evaluation of 
polynucleotides and polypeptides are described in the 

20 following references: Sambrook et al, Molecular Cloning: A 
Laboratory Manual, 2nd ed, Cold Spring Harbor, N.Y., 1989; 
Current Protocols in Molecular Biology, Edited by Ausubel 
F.M. et al., John Wiley and Sons, Inc. New York; PCR 
Cloning Protocols, from Molecular Cloning to Genetic 

25 Engineering, Edited by White B.A. , Humana Press, Totowa, 
New Jersey, 1997, 490 pages; Protein Purification, 
Principles and Practices, Scopes R.K., Springer-Verlag, New 
York, 3rd Edition, 1993, 380 pages; Current Protocols in 
Immunology, Edited by Coligan J.E. et al . , John Wiley & 

30 Sons Inc., New York which are herein incorporated by 
reference . 

For recombinant production, host cells are transfected with 
vectors which encode the polypeptide, and then cultured in 
35 a nutrient media modified as appropriate for activating 

promoters, selecting transf ormants or amplifying the genes. 
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Suitable vectors are those that are viable and replicable 
in the chosen host and include chromosomal, non- chromosomal 
and synthetic DNA sequences e.g. bacterial plasmids, phage 
DNA, baculovirus, yeast plasmids, vectors derived from 
5 combinations of plasmids and phage DNA. The polypeptide 
sequence may be incorporated in the vector at the 
appropriate site using restriction enzymes such that it is 
operably linked to an expression control region comprising 
a promoter, ribosome binding site (consensus region or 

10 Shine-Dalgarno sequence) , and optionally an operator 

(control element) . One can select individual components of 
the expression control region that are appropriate for a 
given host and vector according to established molecular 
biology principles (Sambrook et al, Molecular Cloning: A 

15 Laboratory Manual, 2nd ed, Cold Spring Harbor, N.Y., 1989; 
Current Protocols in Molecular Biology, Edited by Ausubel 
F.M. et al . , John Wiley and Sons, Inc. New York 
incorporated herein by reference) . Suitable promoters 
include but are not limited to LTR or SV40 promoter, E.coli 

20 lac, tac or trp promoters and the phage lambda P L promoter. 
Vectors will preferably incorporate an origin of 
replication as well as selection markers i.e. ampicilin 
resistance gene. Suitable bacterial vectors include pET, 
PQE70, pQE60, pQE-9, pbs, pDIO phagescript, psiX174, 

25 pbluescript SK, pbsks, pNH8A, pNH16a, pNH18A, pNH46A, 

ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 and eukaryotic 
vectors pBlueBacIII, pWLNEO, pSV2CAT, pOG44, pXTl, pSG, 
pSVK3, pBPV, pMSG and pSVL. Host cells may be bacterial 
i.e. E . coli . Bacillus subtilis , Streptomyces ; fungal i.e. 

30 Aspergillus niqer , Aspergillus nidulins ; yeast i.e. 
Saccharomvces or eukaryotic i.e. CHO, COS. 

Upon expression of the polypeptide in culture, cells are 
typically harvested by centrifugation then disrupted by 
35 physical or chemical means (if the expressed polypeptide is 
not secreted into the media) and the resulting crude 
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extract retained to isolate the polypeptide of interest. 
Purification of the polypeptide from culture media or 
lysate may be achieved by established techniques depending 
on the properties of the polypeptide i.e. using ammonium 
5 sulfate or ethanol precipitation , acid extraction, anion 
or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, 
hydroxyl apatite chromatography and lectin chromatography. 
Final purification may be achieved using HPLC. 

10 

The polypeptide may be expressed with or without a leader 
or secretion sequence. In the former case the leader may 
be removed using post-translational processing (see US 
4,431,739; US 4,425,437; and US 4,338,397 incorporated 
15 herein by reference) or be chemically removed subsequent to 
purifying the expressed polypeptide. 

According to a further aspect, the streptococcus 
polypeptides of the invention may be used in a diagnostic 
20 test for streptococcus infection, in particular S . 

pneumoniae infection. Several diagnostic methods are 
possible, for example detecting streptococcus organism in a 
biological sample, the following procedure may be followed: 
a) obtaining a biological sample from a patient; 
25 b) incubating an antibody or fragment thereof reactive 

with a streptococcus polypeptide of the invention with 
the biological sample to form a mixture; and 
c) detecting specifically bound antibody or bound 

fragment in the mixture which indicates the presence 
30 of streptococcus. 

Alternatively , a method for the detection of antibody 
specific to a streptococcus antigen in a biological sample 
containing or suspected of containing said antibody may be 
35 performed as follows: 

a) obtaining a biological sample from a patient; 
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b) incubating one or more streptococcus polypeptides of 
the invention or fragments thereof with the biological 
sample to form a mixture; and 

c) detecting specifically bound antigen or bound fragment 
5 in the mixture which indicates the presence of 

antibody specific to streptococcus. 

One of skill in the art will recognize that this diagnostic 
test may take several forms, including an immunological 
10 test such as an enzyme-linked immunosorbent assay (ELISA) , 
a radioimmunoassay or a latex agglutination assay, 
essentially to determine whether antibodies specific for 
the protein are present in an organism. 

15 The DNA sequences encoding polypeptides of the invention 
may also be used to design DNA probes for use in detecting 
the presence of streptococcus in a biological sample 
suspected of containing such bacteria. The detection 
method of this invention comprises: 
20 a) obtaining the biological sample from a patient; 
b) incubating one or more DNA probes having a DNA 

sequence encoding a polypeptide of the invention or 
fragments thereof with the biological sample to form a 
mixture ; and 

25 c) detecting specifically bound DNA probe in the mixture 
which indicates the presence of streptococcus 
bacteria . 

The DNA probes of this invention may also be used for 

30 detecting circulating streptococcus i.e. 

S .pneumoniae nucleic acids in a sample, for example using a 
polymerase chain reaction, as a method of diagnosing 
streptococcus infections. The probe may be synthesized 
using conventional techniques and may be immobilized on a 

35 solid phase, or may be labelled with a detectable label. A 
preferred DNA probe for this application is an oligomer 
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having a sequence complementary to at least about 6 
contiguous nucleotides of the streptococcus pneumoniae 
polypeptides of the invention. 

5 Another diagnostic method for the detection of 
streptococcus in a patient comprises: 

a) labelling an antibody reactive with a polypeptide of 
the invention or fragment thereof with a detectable 
label; 

10 b) administering the labelled antibody or labelled 
fragment to the patient; and 
c) detecting specifically bound labelled antibody or 

labelled fragment in the patient which indicates the 
presence of streptococcus. 

15 

A further aspect of the invention is the use of the 
streptococcus polypeptides of the invention as immunogens 
for the production of specific antibodies for the diagnosis 
and in particular the treatment of streptococcus infection. 

20 Suitable antibodies may be determined using appropriate 

screening methods, for example by measuring the ability of 
a particular antibody to passively protect against 
streptococcus infection in a test model. One example of an 
animal model is the mouse model described in the examples 

25 herein. The antibody may be a whole antibody or an 

antigen-binding fragment thereof and may belong to any 
immunoglobulin class. The antibody or fragment may be of 
animal origin, specifically of mammalian origin and more 
specifically of murine, rat or human origin. It may be a 

30 natural antibody or a fragment thereof, or if desired, a 
recombinant antibody or antibody fragment. The term 
recombinant antibody or antibody fragment means antibody or 
antibody fragment which was produced using molecular 
biology techniques. The antibody or antibody fragments may 

35 be polyclonal, or preferably monoclonal. It may be 
specific for a number of epitopes associated with the 
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specific for one. 



Without limiting its scope, the present invention also 
relates to new antigens designated BVH-3, BVH-11, BVH-11-2, 
BVH-28 and BVH-71. The present invention also relates to 
truncated polypeptides comprising fra[gments of the new 
antigens designated BVH-3, BVH-11, BVH-11-2, BVH-28 and 
BVH-71. The present invention also relates to chimeric 
polypeptides comprising fragments of the new antigens 
designated BVH-3, BVH-11, BVH-11-2, BVH-28 and BVH-71. The 
following is a reference table summarizing the relation 
between the antigens of the present invention: 



Family 


Nucleotide SEQ ID 
NO 


Polypeptide SEQ ID 
NO 


BVH-3 


BVH-3 


1, 11 


2 


BVH-3 A 


7 


8 


BVH-3B 


9 


10 


BVH-3 SP63 


15 


16 


BVH-3M 




55 


BVH-3 AD 




56 


L- BVH-3 AD 




57 . 


Newl2 


76 


58 


BVH-3C 




59 


Newl 




64 


New2 




65 


New3 




66 


Newl 5 




78 


BVH-11 


BVH-11 


3, 12 


4 


BVH-11-2 


13 


14 


BVH-11M 




60 


BVH-11A 




61 


BVH-11B also 
referred to as 
NEW13 




62 


BVH-11C 




63 


New4 




67 


New5 




68 



35 



5 
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Family 


Nuc leo t xde SEQ ID 


Polypeptide S£*q i° 




NO 


MU 


New6 






New7 




/ U 


New8 




/ X 


New9 




/ Z 


BVH-11-2M 




73 


NewlO 




74 


Newll 




75 


Newl2 


76 


58 


Newl4 




77 


Newl6 




79 


BVH-28 


BVH-28 


5 


6 


BVH-71 


GBS 


80 


81 


GAS 


82 


83 



EXAMPLE 1 

5 This example illustrates the cloning of pneumoniae genes. 

The coding region of pneumoniae gene BVH-3 (SEQ ID NO: 1) 
and the coding region of S^ pneumoniae gene BVH-28 (SEQ ID 
NO: 5) were amplified by PCR (DNA Thermal Cycler GeneAmp 

10 PCR system 2400 Perkin Elmer, San Jose, CA) from genomic DNA 
of serogroup 6 S^ pneumoniae strain SP64 using the oligos 
that contained base extensions for the addition of 
restriction sites Bglll (AGATCT) and Xbal (TCTAGA) . PCR 
products were purified from agarose gel using a QIAquick gel 

15 extraction kit from QIAgen (Chatsworth, CA) , digested Bglll- 
Xbal (Pharmacia Canada Inc, Baie d'Urfe, Canada), extracted 
with phenol : chloroform and precipitated with ethanol. The 
Superlinker vector pSL301 (Invitrogen, San Diego, CA) was 
digested with Bglll and Xbal and purified from agarose gel 

20 using a QIAquick gel extraction kit from QIAgen (Chatsworth, 
CA) . The Bglll -Xbal genomic DNA fragments were ligated to 
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the Bglll-Xbal pSL301 vector. The ligated products were 
transformed into coli strain DH5a [ f 80 2acZ DM15 endAl 
recAl hsdRll ( r K'*K+) supE44 thi-11" gyrA96 relAl D ( lacZYA- 
argF)U169] (Gibco BRL, Gaithersburg, MD) according to the 

5 method of Simanis (Hanahan, D. DNA Cloning, 1985, D.M. 
Glover (ed) , pp. 109-135). Recombinant pSL301 plasmids 
(rpSL301) containing either BVH-3 or BVH-28 gene were 
purified using a QIAgen kit (Chatsworth, CA) and DNA inserts 
were confirmed by nucleotide sequence analysis (Taq Dye 

10 Deoxy Terminator Cycle Sequencing kit, ABI, Foster City, 
CA) . Recombinant rpSL301 (rpSL301) were digested with the 
restriction enzymes Bglll (AGATCT) and Xhol (CTCGAG) . DNA 
fragments Bglll-Xhol were purified using the QIAquick gel 
extraction kit from QIAgen (Chatsworth, CA) . pET-32c(+) 

15 expression vector (Novagen, Madison, WI) containing the 
thioredoxin-His -Tag sequence was digested with BamHI 
(GGATCC) and Xhol and gel extracted using the QIAquick gel 
extraction kit from QIAgen (Chatsworth, CA) . The Bglll-Xhol 
DNA fragments were ligated to the BamHI-XhoI pET-32c(+) 

20 vector to create the coding sequence for thioredoxin- 

His • Tag-BVH-3 or thioredoxin-His *Tag-BVH-28 fusion protein. 
The ligated products were transformed into E^ coli strain 
DH5a [f80 JacZ DM15 endAl recAl hsdRll ( r K _rn K + ) supE44 thi-11" 
gyrA96 relAl D ( lacZYA-argF) U169] (Gibco BRL, Gaithersburg, 

25 MD) according to the method of Simanis (Hanahan, D. DNA 

Cloning, 1985, D.M. Glover (ed) , pp. 109-135). Recombinant 
pET-32c(+) plasmids were purified using a QIAgen kit 
(Chatsworth, CA) and the nucleotide sequences at the fusion 
sites of thioredoxin-His -Tag and DNA insert were verified by 

30 DNA sequencing (Taq Dye Deoxy Terminator Cycle Sequencing 
kit, ABI, Foster City, CA) . 
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EXAMPLE 2 

This example illustrates the cloning of pneumoniae 
protein genes in CMV plasmid pCMV-GH. 

The DNA coding region of a S. pneumoniae protein was 
inserted in phase downstream of a human growth hormone (hGH) 
gene which was under the transcriptional control of the 
cytomegalavirus (CMV) promotor in the plasmid vector pCMV-GH 
(Tang et al., Nature, 1992, 356 :152). The CMV promotor is 
non functional plasmid in E^ coli cells but active upon 
administration of the plasmid in eukaryotic cells. The 
vector also incorporated the ampicillin resistance gene. 

The coding region of BVH-3 gene (SEQ ID NO: 1) and BVH-28 
gene (SEQ ID NO: 5) were obtained from rpSL301 (see example 
1) using restriction enzymes Bglll (AGATCT) and Xbal 
(TCTAGA) . The digested products were purified from agarose 
gel using the QIAquick gel extraction kit from QIAgen 
(Chatsworth, CA) . The pCMV-GH vector (Laboratory of Dr. 
Stephen A. Johnston, Department of Biochemistry, The 
University of Texas, Dallas, Texas) containing the human 
growth hormone to create fusion proteins was digested with 
Bglll and Xbal and purified from agarose gel using the 
QIAquick gel extraction kit from QIAgen (Chatsworth, CA) . 
The Bglll-Xbal DNA fragments were ligated to the Bglll-Xbal 
pCMV-GH vector to create the hGH -BVH-3 or hGH-BVH-28 fusion 
protein under the control of the CMV promoter. The ligated 
products were transformed into E^_ coli strain DH5a[f80 lacZ 
DM15 endAl recAl hsdR17 ( r K _m lO supE44 thi-11' gyrA96 relAl 
D(IacZYA-argF)U169] (Gibco BRL, Gaithersburg, MD) according 
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to the method of Simanis (Hanahan, D. DNA Cloning, 1985, 
D.M. Glover (ed) , pp. 109-135). The recombinant pCMV 
plasmids were purified using a QIAgen kit (QIAgen, 
Chatsworth, CA) . 

5 

The coding region of BVH-11 gene (SEQ ID NO: 3) was 
amplified by PGR (DNA Thermal Cycler GeneAmp PCR system 2400 
Perkin Elmer, San Jose, CA) from genomic DMA of serogroup 6 
S. pneumoniae strain SP64 using the oligos that contained 

10 base extensions for the addition of restriction sites Bglll 
(AGATCT) and Hindlll (AAGCTT) . The PCR product was purified 
from agarose gel using a QIAquick gel extraction kit from 
QIAgen (Chatsworth, CA) , digested with restriction enzymes 
(Pharmacia Canada Inc, Baie d'Urfe, Canada), extracted with 

15 phenol : chloroform and precipitated with ethanol. The 
pCMV-GH vector (Laboratory of Dr. Stephen A. Johnston, 
Department of Biochemistry, The University of Texas, Dallas, 
Texas) was digested with Bglll and Hindlll and purified from 
agarose gel using the QIAquick gel extraction kit from 

20 QIAgen (Chatsworth, CA) . The Bglll-Hindlll DNA fragment was 
ligated to the Bglll-Hindlll pCMV-GH vector to create the 
hGH-BVH-11 fusion protein under the control of the CMV 
promoter. The ligated products were transformed into E^ coli 
strain DH5a[f80 lacZ DM15 endAl recAl hsdR17 ( r K" m K + ) supE44 

25 thi-11" gyrA96 relAl D(lacZYA-argF)U169] (Gibco BRL, 
Gaithersburg, MD) according to the method of Simanis 
(Hanahan, D. DNA Cloning, 1985, D.M. Glover (ed) , pp. 109- 
135) . The recombinant pCMV plasmid was purified using a 
QIAgen kit (Chatsworth, CA) and the nucleotide sequence of 

30 the DNA insert was verified by DNA sequencing. 
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EXAMPLE 3 

This example illustrates the use of DNA to elicit an immune 
response to S^ pneumoniae antigens. 

5 

A group of 8 female BALB/c mice (Charles River, St-Constant, 
Quebec, Canada) were immunized by intramuscular injection of 
50 (il three times at two- or three-week intervals with 100 
ug of recombinant pCMV-GH encoding the BVH-3, BVH-11 or the 

10 BVH-28 gene in presence of 50 \xg of granulocyte-macrophage 
colony-stimulating factor (GM-CSF)- expressing plasmid pCMV- 
GH-GM-CSF {Laboratory of Dr. Stephen A. Johnston, Department 
of Biochemistry, The University of Texas, Dallas, Texas) . 
As control, a group of mice were injected with 100 \ig of 

15 pCMV-GH in presence of 50 ^g of pCMV-GH-GM-CSF . Blood 
samples were collected from the orbital prior to each 
immunization and seven days following the third injection 
and serum antibody responses were determined by EL ISA using 
thioredoxin-His ■ Tag-S . pneumoniae fusion protein as coating 

20 antigen. DNA immunization with recombinant plasmid pCMV-GH 
encoding the BVH-3, BVH-11 or the BVH-28 S^ pneumoniae 
protein induced antibody reactive against the respective 
recombinant protein. The reciprocal antibody titers, defined 
as the highest serum dilution at which the absorbance values 

25 were 0.1 above the background values, were above 4x1 0 3 . 

EXAMPLE 4 

30 This example illustrates the production and purification of 
recombinant S^ pneumoniae proteins. 
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The recombinant pET plasmids containing the BVH-3, BVH-11 or 
the BVH-28 gene corresponding to the SSQ ZD £30s 1 , SSQ ID 
S30s 3 or the SSQ ID BJOs 5 respectively were transformed by 

5 electroporation (Gene Pulser II apparatus , BIO-RAD Labs, 
Mississauga, Canada) into coli strain AD494 (DE3) (Dara~ 
leu7697 DlacX74 DphoA PvuII phoR DmaIF3 F' [lac* (lacl q ) pro] 
trxB::Kan) (Novagen, Madison, WI) . In this strain of E. 
coli , the T7 promotor controlling expression of the fusion 

10 protein is specifically recognized by the T7 RNA polymerase 
(present on the 1DE3 prophage) whose gene is under the 
control of the lac promotor which is inducible by isopropyl- 
S-d-thio-galactopyranoside (IPTG) . The trans formant 
AD494 (DE3) /rpET was grown at 37°C with agitation at 250 rpm 

15 in LB broth (peptone lOg/L, yeast extract 5g/L, NaCl lOg/L) 
containing 100|ig of ampicillin (Sigma-Aldrich Canada Ltd., 
Oakville, Canada) per ml until the k^oo reached a value of 
0.6. In order to induce the production of the thioredoxin- 
His-Tag-BVH-3, thioredoxin-His-Tag-BVH-11 or thioredoxin- 

20 His-Tag-BVH-28 fusion protein, the cells were incubated for 
2 additional hours in the presence of IPTG at a final 
concentration of 1 mM. Induced cells from a 100 ml culture 
were pelleted by centrifugation and frozen at -70°C. 

25 The purification of the fusion proteins from the soluble 
cytoplasmic fraction of IPTG-induced AD494 (DE3) /rpET was 
done by affinity chromatography based on the properties of 
the His -Tag sequence (6 consecutive histidine residues) to 
bind to divalent cations (Ni 2+ ) immobilized on the His -Bind 

30 metal chelation resin. Briefly, the pelleted cells obtained 
from a lOOmL culture induced with IPTG were resuspended in 
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phosphate-buffered (PBS) :500mM NaCl pH7.1, sonicated and 
spun at 20,000 X g for 20 min to remove debris. The 
supernatant was filtered (0.22pn pore size membrane) and 
deposited on a HiTrap® ImL chelating pre-packed ready-to-use 
5 column (Pharmacia Biotech, Baie d'Urfe, Canada) . The 

thioredoxin-His - Tag-S . pneumoniae fusion protein was eluted 
with 1M imidazole- 500mM NaCl-PBS pH7.1. The removal of the 
salt and imidazole from the sample was done by dialysis 
against PBS at 4°C. The quantities of fusion protein 
10 obtained from the soluble fraction of E^ coli was estimated 
by MicroBCA (Pierce, Rockford, Illinois) . 



EXAMPLE 5 

15 

This example illustrates the protection of mice against 
fatal pneumococcal infection by immunization. 

Groups of 8 female BALB/c mice (Charles River) were 
20 immunized subcutaneous ly three times at three-week intervals 
with either 25 ^g of affinity purified thioredoxin-His -Tag- 
BVH-3 fusion protein in presence of 15 fig of QuilA adjuvant 
(Cedarlane Laboratories Ltd, Hornby, Canada) or, as control, 
with QuilA adjuvant alone in PBS. Blood samples were 
25 collected from the orbital sinus on day 1, 22 and 43 prior 
to each immunization and seven days (day 50) following the 
third injection. One week later the mice were challenged 
with approximately 10 6 CFU of the type 3 S^ pneumoniae 
strain WU2 . Samples of the S^ pneumoniae challenge inoculum 
30 were plated on chocolate agar plates to determine the CFU 
and to verify the challenge dose. Deaths were recorded for 
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a period of 14 days and on day 14 post-challenge, the 
surviving mice were sacrificied and blood samples tested for 
the presence of pneumoniae organisms. The survival data 
are shown in table 1. 

5 

Prechallenge sera were analyzed for the presence of 
antibodies reactive with pneumoniae by standard 
immunoassays. Elisa and immunoblot analyses indicated that 
immunization with recombinant pneumoniae protein produced 
10 in E^ coli elicited antibodies reactive with both, 
recombinant and native pneumococcal protein. 



Table 1. Protection mediated by recombinant BVH-3 protein 



Immunogen 


No. of mice alive : no. of mice 
dead 

14 days post-challenge 


Median day of 
death 


BVH-3 


8 : 0 


>14 


none 


0 : 8 


1 



15 

All mice immunized with BVH-3 recombinant protein survived 
to infection while none of the control mice given adjuvant 
alone survived. There was a significant difference in 
survival between the two groups of mice (P<0.0001, log rank 
20 test for nonparametric analysis of survival curves; 

P=0.0002, Fisher's exact test). All hemocultures from 
surviving mice were negative at day 14 post-challenge. 



25 

EXAMPLE 6 
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This example describes the cloning of BVH-3 and BVH-11 
genes from a variety of pneumoniae strains and the 
molecular conservation of these genes . 

5 Molecular analysis of chromosomal DNA from various S. 
pneumoniae isolates with DNA probes spanning different 
regions of BVH-3 or BVH-11 revealed the presence of one 
BVH-3 gene copy and two BVH-11 gene copies. The two BVH-11 
gene copies are not identical and the genes were 
10 arbitrarily designated BVH-11 <SEQ ID NO:12; ORF at 

nucleotides 45 to 2567) and BVH-11-2 (SEQ ID NO: 13; ORF at 
nucleotides 114 to 2630) . 

The first amino acids of the BVH-3 and BVH-11 coding 
15 regions have the characteristics of leader sequences also 
known as signal peptides. The consensus signal peptidase 
cleavage site L-X-X-C of lipoprotein 

modification/processing sites was present in the sequences. 
Mature BVH-3, BVH-11 and BVH-11-2 proteins from 

20 pneumoniae SP64 have 1019, 821 and 819 amino acids, 

respectively. The regions of pneumoniae genes coding 
for mature BVH-3, termed BVH-3M, (nucleotides 1837 - 4896; 
SEQ. ID. NO: 11), BVH-11M (nucleotides 102-2567; SEQ. ID. 
NO: 12) and BVH-11-2M (nucleotides 171-2630; SEQ. ID. NO: 

25 13), were amplified by PCR ( DNA Thermal Cycler GeneAmp PCR 
system 2400 Perkin Elmer, San Jose, CA) from genomic DNA of 
6 or 7 S^ pneumoniae strains. Serogroup 6 S_^ pneumoniae 
SP64 and serogroup 9 SP63 clinical isolates were provided 
by the laboratoire de la sante publique du Quebec, Sainte- 

30 Anne-de-Bellevue; serotype 4 strain JNR.7/87 was provided 
by Andrew Camilli, Tufts University School of Medicine, 
Boston; Rxl strain, a nonencapsulated derivative of the 
type 2 strain D39 and the type 3 strains A66 and WU2 were 
provided by David E. Briles from University of Alabama, 

35 Birmingham and the type 3 clinical isolate P4241 was 

provided by the centre de recherche en inf ectiologie du 
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centre hospitalier de 1 ' universite Laval, Sainte-Foy. The 
sets of oligonucleotide primers OCRR479-OCRR480 ; HAMJ160- 
OCRR488 and HAMJ160-HAMJ186 , that contained base extensions 
for the addition of restriction sites were used for the 
amplification of BVH-3 , BVH-11 and BVH-11-2 gene, 
respectively, with the exception of BVH-11 gene from SP64 
strain which was amplified using the set of primers 
consisting of HAMJ487 and OCRR488. Primer sequences are 
listed below (Table 2) . PCR products were purified from 
agarose gel using a QIAquick gel extraction kit from QIAgen 
(Chatsworth, CA) and digested Bglll-Xbal or Bglll-Hindlll 
(Pharmacia Canada Inc, Baie d'Urfe, Canada) . Digestions 
were cleaned using a QIAquick PCR purification kit from 
QIAgen (Chatsworth, CA) . The PCR products were ligated to 
the Bglll-Xbal or Bglll-Hindlll pSL3 01 vector. The ligated 
products were transformed into coli strain DH5a [080 
lacZ AM15 endAl recAl hsdR17 ( r K*Y) supE44 thi-lX" gyrA96 
relAl A(lacZYA-argF)U169] (Gibco BRL, Gaithersburg, MD) 
according to the method of Simanis (Hanahan, D. DNA 
Cloning, 1985, D.M. Glover (ed) , pp. 109-135). Recombinant 
pSL301 plasmids (rpSL301) containing BVH-3 , BVH-11 or 
BVH11-2 were purified using a QIAgen kit (Chatsworth, CA) 
and DNA inserts were sequenced (Taq Dye Deoxy Terminator 
Cycle Sequencing kit, ABI , Foster City, CA) . The figures 11 
and 12 depict the consensus sequence established from the 
BVH-3 , and BVH-11 deduced amino acid sequences, 
respectively. Comparison of BVH-3 protein sequences 
revealed 99 to 100% identity of sequences for all strains 
with the exception that BVH-3 from serogroup 9 SP63 strain 
(SEQ. ID. NO: 15 and SEQ. ID. NO: 16) misses a stretch of 
177 amino acids corresponding to residues 244 to 420 on 
BVH-3 'protein sequence of pneumoniae SP64. Analysis of 
sequences of additional serogroup 9 strains revealed BVH-3 
molecule having the same deletion in 3 out of 4 strains 
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thus suggesting that the 3 strains are members of a S. 
pneumoniae serogroup 9 clone. 

Comparison of 13 BVH-11 nucleotide sequences obtained from 

5 7 S_;_ pneumoniae strains, revealed that the nucleotide 

sequences are very similar. Computer analysis (MacVector, 
Clustal W 1.4) using multiple alignment of the predicted 
BVH-11 protein sequences revealed that these sequences were 
75% identical and 82 % homologous on a length of 834 amino 

10 acids. Pairwise alignment revealed 80 to 100% identity 
(Figure 13) . The sequences showed great similarity in 
overall organization. Variability in the primary sequence 
of these proteins is almost restricted to the last 125 
amino acids in the C-terminal portion of the proteins. This 

15 region constitutes a domain. Close examination of this 
domain revealed two groups of sequences. The first 9 
sequences from the figure 13 belong to one group while the 
last 4 sequences belong to another group. A 39% identity 
value is obtained when the domain sequences of the 13 

20 proteins are compared (MacVector, Clustal W 1.4). The 
identity value increased to more than 92% when sequences 
belonging to a same group are compared. 

25 EXAMPLE 7 

This example illustrates the homology of portions of BVH-3 
and BVH-11 genes. 

30 Molecular analysis with DNA probes derived from BVH-3 and 
BVH-11 genes indicated that BVH-3 and BVH-11 were related. 
In dot blot hybridization studies, DNA probe consisting of 
either, BVH-3 or BVH-11, gene sequence hybridized to both, 
BVH-3 and BVH-11 genes thus indicating that BVH-3 and BVH- 

35 11 genes shared homologous sequences. Comparison of 

sequences revealed that the ORFs and the proteins were 43 
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and 33% identical, respectively. Closer examination 
revealed that the region corresponding to amino acids 1 to 
225 in BVH-3 and 1 to 228 in BVH-11 were 73 and 75% 
identical at the DNA and protein level, respectively. In 
5 contrast, the 3' regions corresponding to amino acids 226 
to 1039 from BVH-3 and amino acids 229-840 from BVH-11 were 
only 34 and 22% identical at the DNA and protein level, 
respectively. Thus the 5' termini of BVH-3 and BVH-11 
genes appear to contain highly conserved sequences while 

10 the remaining parts of the genes are highly divergent. 
These results suggest that BVH-3 and BVH-11 might share 
similar functions mediated by sequences present in the 
conserved region whereas BVH-3- and BVH-ll-specif ic 
functions might be mediated by sequences in the divergent 

15 region. 

EXAMPLE 8 

20 This example describes the cloning of truncated BVH-3 , BVH- 
11 and. BVH-11 -2 genes by polymerase chain reaction (PCR) 
and the expression of truncated BVH-3 and BVH-11 molecules. 

Gene fragments were amplified by PCR using pairs of 
25 oligonucleotide engineered to amplify fragments spanning 

the BVH-3 (SEQ ID NO: 1 and SEQ ID NO: 11), BVH-11 (SEQ ID 
NO: 3 and SEQ ID NO: 12) or BVH-11-2 (SEQ ID NO: 13) gene 
from pneumoniae strain SP64. Each of the primers had a 
restriction endonuclease site at the 5' end, thereby 
30 allowing directional in- frame cloning of the amplified 

product into the digested plasmid vector (Tables 2 and 3). 
PCR-amplif ied products were digested with restriction 
endonucleases and ligated to either linearized plasmid 
PSL301 (see example 1), pCMV-GH (see example 2) or pET 
35 (Novagen, Madison, WI) expression vector digested likewise 
or digested with enzymes that produce compatible cohesive 
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ends. Recombinant pSL301 and recombinant pCMV-GH plasmids 
were digested with restriction enzymes for the in-frame 
cloning in pET expression vector. Clones were first 
stabilized in coli DH5a before introduction into coli 
5 BL21(XDE3) or AD494 (ADE3 ) for expression of truncated BVH- 
3 or BVH-11 molecules. Each of the resultant plasmid 
constructs was confirmed by nucleotide sequence analysis. 
The recombinant proteins were expressed as N-terminal 
fusions with the thioredoxin and His-tag or as C-terminal 

10 fusions with an His-tag. The expressed recombinant 

proteins were purified from supernatant fractions obtained 
from centrifugation of sonicated IPTG-induced E^. coli 
cultures using a His-Bind metal chelation resin (QIAgen, 
Chatsworth, CA) . The gene products generated are listed in 

15 the table 3. The gene products corresponding to the N- 
terminal region including the signal sequence are 
designated as Lipidated-proteins or lipoproteins (L- 
proteins) . The gene products corresponding to the N- 
terminal region lacking the signal sequence are identified 

20 as protein without signal sequence (w/o ss) . 

Table 2. List of PCR oligonucleotide primers 

25 



Primer 


SEQ. 
ID. 


Sequence 5* - 3' 


Nucleotide 
position 


Restric- 
tion 
sites 


OCRR 479 


17 


cagtagatctgtgcctatgcactaaac 


SEQ ID I :61- 
78 


BglH 


OCRR 480 


18 


gatctctagactactgctattccttacgctatg 


SEQ 

ID 11 :4909- 
4887 


Xbal 


OCRR 497 


19 


atcactcgagcattacctggataatcctgt 


SEQ 

ID 1:1525- 
1506 


Xhol 


OCRR 498 


20 


ctgctaagcttatgaaagatttagat 


SEQ 

ID 1 :1534- 
1548 


HindHI 
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OCRR499 


21 


gatactcgagctgctattccctac 


SEQ 

ID 11 :4906- 
4893 


Xhol 


HAMJ 172 


22 


gaatctcgagttaagctgctgctaattc 


SEQ ID 1 : 

A7^_AA 1 
O/D-OOl 


Xhol 


HAMJ247 


23 


gacgctcgagcgctatgaaatcagataaattc 


SEQ ID 

1 .oil /-jUVO 


Xhol 


HAMJ 248 


24 


gacgctcgagggcattacctggataatcctgttcatg 


SEQ ED 

1 : 1527-1501 


Xhol 


HAMJ 249 


25 


cagtagatctcttcatcatttattgaaaagagg 


SEQ ID 11 : 

1749-1771 


Bglll 


HAMJ 278 


26 


ttatttcttccatatggacttgacagaagagcaaattaag 


SEQ ID 

1 . 1 A 1 A 1 jlOT 

1 ; 1414-1437 


Ndel 


HAMJ 279 


27 


cgccaagcttcgctatgaaatcagataaattc 


SEQ ID 

1 :3 117-3096 


HindlU 


HAMJ 280 


28 


cgccaagcttctccacaatataagtcgattgatt 


SEQ ED 

1 : 2400-2377 


HindlH 


HAMJ 281 


29 


ttatttcttccatatggaagtacctatcttggaaaaagaa 


SEQ ID 

1 : 2398-2421 


Ndel 


HAMJ 300 


30 


ttatttcttccatatggtgcctatgcactaaaccagc 


SEQ ID 1 :62- 
82 


Ndel 


HAMJ 313 


31 


ataagaatgcggccgcctccacaatataagtcgattgatt 


SEQ ID 

1 :2400-237 / 


Nod 


OCRR 487 


32 


cagtagatctgtgcttatgaactaggtttgc 


SEQ ID 3 :58- 
79 


Bglll 


OCRR 488 


33 


gatcaagcttgctgctacctttacttactctc 


SEQ ID 

12 :2577-255o 


Hindni 


HAMJ 171 


34 


ctgagatatccgttatcgttcaaacc 


SEQ ID 

3 .1060-1075 


EcoRV 


HAMJ 251 


35 


ctgcaagcttttaaaggggaataatacg 


SEQ ID 

*5 . 1 f\Cf\ 1 C\A C 

3 : 1059- 1043 


Hindin 


HAMJ 264 


36 


cagtagatctgcagaagccttcctatctg 


SEQ ID 3 :682- 

700 


Bglll 


HAMJ 282 


37 


tcgccaagcttcgttatcgttcaaaccattggg 


SEQ ID 

1 . 1 AAA 1 AQ 1 

3 :10oO-10ol 


HindUI 


HAMJ 283 


38 


ataagaatgcggccgccttactctcctttaataaagccaat 
agtt 


SEQ ID 

3 : 25 20-2492 


Ndel 


HAMJ 284 


39 


catgccatggacattgatagtctcttgaaacagc 


SEQ ID 3:856- 
oSO 


Ncol 


HAMJ 285 


40 


cgccaagcttcttactctcctttaataaagccaatag 


SEQ ID 


HindHI 


HAMJ 286 


41 


cgacaagcttaacatggtcgctagcgttacc 


SEQ ID 
3:2139-2119 


Hindni 


HAMJ 287 


42 


cataccatgggcctttatgaggcacctaag 


SEQ ID 

3 :2014-2034 


Ncol 


HAMJ 288 


43 


cgacaagcttaagtaaatcttcagcctctctcag 


SEQ ID 

3 :2376-2353 


HindUI 
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HAMJ289 


44 


gataccatggctagcgaccatgttcaaagaa 


SEQ ID 
3:2125-2146 


Ncol 


HAMJ290 


45 


cgccaagcttatcatccactaacttgactttatcac 


SEQ ID 
3:1533-1508 


Hindm 


HAMJ291 


46 


cataccatggatattcttgccttcttagctccg 


SEQ ID 
3:1531-1554 


Ncol 


HAMJ301 


47 


catgccatggtgcttatgaactaggtttgc 


SEQ ID 3 :59- 
79 


Ncol 


HAMJ 302 


48 


cgccaagctttagcgttaccaaaaccattatc 


SEQ ID 
3:2128-2107 


Hindlll 


HAMJ 160 


49 


gtattagatctgttcctatgaacttggtcgtcacca 


SEQ ID 13 : 
172-196 


BglH 


HAMJ 186 


50 


cgcctctagactactgtataggagccgg 


SEQ ID 13: 
2460-2443 


Xbal 


HAMJ 292 


51 


catgccatggaaaacatttcaagccttttacgtg 


SEQ ID 11: 
754-778 


Ncol 


HAMJ 293 


52 


cgacaagcttctgtataggagccggttgactttc 


SEQ ID 11 : 
2457-2434 


Hindlll 


HAMJ 294 


53 


catgccatggttcgtaaaaataaggcagaccaag 


SEQ ID 11 : 
2038-2062 


Ncol 


HAMJ 297 


54 


catgccatggaagcctattggaatgggaag 


SEQ ID 11: 
622-642 


Ncol 
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EXAMPLE 9 

This example describes the isolation of monoclonal 
antibodies (Mabs) and the use of Mabs to characterize BVH- 
5 3, BVH-11 and BVH-11-2 protein epitopes. 

Female BALB/c mice (Charles River) were immunized 
subcutaneous ly with BVH-3 , BVH-11 or BVH-11-2 gene products 
from Sj_ pneumoniae strain SP64 in presence of 15 /ig of 

10 QuilA adjuvant (Cedar lane Laboratories Ltd, Hornby, 
Canada) . One set of mice (fusion experiment 1) were 
immunized on day 1 and 14 with 25 tig of affinity purified 
thioredoxin-His»Tag-BVH-3M fusion protein. A second group 
of mice (fusion experiment 2) were immunized three times at 

15 three-week intervals with 25 Mg of affinity purified 

thioredoxin-His»Tag-BVH-llM. A third group of mice (fusion 
experiment 3) were immunized on day 1 and day 15 with 25 iig 
of affinity purified thioredoxin-His»Tag-BVH-ll-2M fusion 
protein. A fourth group of mice (fusion experiment 4) were 

20 immunized on day 1 with 25 iig of affinity purified 

thioredoxin-His«BVH-llB fusion protein and boosted by 
intravenous injection on day 16 and on day 37 with 
recombinant BVH-11B in PBS. Three to "four days before 
fusion, mice were injected intravenously with 25 iig of the 

25 respective antigen suspended in PBS alone. Hybridomas were 
produced by fusion of spleen cells with nonsecreting SP2/0 
myeloma cells as previously described by J. Hamel et al. 
[J. Med. Microbiol., 23, ppl63-170 (1987)]. Culture 
supernatants of hybridomas were initially screened by 

30 enzyme-linked-immunoassay according to the procedure 

described by Hamel et al. (Supra) using plates coated with 
preparations of purified recombinant proteins or 
suspensions of heat-killed S^. pneumoniae cells. Positive 
hybridomas selected on the basis of EL ISA reactivity with a 



53 



6NSDOCID: <WO 0039299A2_I_> 



WO 00/39299 



PCT/CA99/01218 



variety of antigens were then cloned by limiting dilutions, 
expanded and frozen. 

Hybridomas were tested by EL ISA or Western immunoblotting 
5 against BVH-3 and BVH-11 gene products in order to 

characterize the epitopes recognized by the Mabs. BVH-3 
and BVH-11 shared common epitopes with 6 Mabs (H3-1-F9, H3- 
1-D4, H3-1-H12, H11-1-E7, H11-1-H10 and Hll-1 . 1-G11 ) 
showing reactivities with both proteins (Table 4) . BVH-11 
10 and BVH-11-2 molecules from £L_ pneumoniae SP64 shared 

common epitopes not present on BVH-3 with Mabs (3A1, 13C11, 
10H10, 1D8, 10G9, 10A2, 3E8, 10D7, 2H7 and 6H7 ) reactive 
with both, BVH-11 and BVH-11-2, recombinant proteins (Table 
5) . 



15 

Table 4. Reactivity of BVH-3-irnmunoreactive Mabs with a 
panel of BVH-3 and BVH-11 gene products 



MAbs 


a . Immunoreactivity with 


BVH-3M 
21-1039 


BVH-3 A 
21-509 


BVH-3B 
512-1039 


BVH-3C 
21-225 


NEW2 
472-800 


NEW 3 

800-1039 


BVH-11M 
20-840 


H3-1-F9 




+ 




+ 






+ 


H3-1-D4 


+ 


+ 




+ 






+ 


H3-1-H12 


+ 


+ 




+ 






+ 


H3-2-G2 


+ 


+ 












H3-3-A1 


+ 


+ 












H3-4-D3 


+ 




+ 






+ 




H11-1-E7 


+ 


+ 




+ 






+ 


Hll-1- 
H10 


+ 


+ 




+ 






+ 


Hll- 
1.1-G11 


+ 

1 


+ 




+ 


+ 




+ 
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Table 5. Reactivity of Mabs raised against BVH-11-2 
protein from ^ pneumoniae strain SP64 with a panel of BVH^ 
11 gene products 



Mabs' 


b.Immunoreactivity with 


c.BVH-11 products 


d.BVH-U-2 products 


BVH-11M 
20-840 


NEW8 
286-511 


NEW9 
511-713 


BVH-11B 
354-840 


BVH-11-2 
20-838 


NEW1C 
271-838 


rail 

699-838 


NEW14 
227-699 


3A1 


+ 


+ 




+ 


+ 


+ 




+ 


13C1 


+ 


+ 


+ 


+ 


+ 


+ 




+ 


10H10 


+ 


+ 


i 


+ 


+ 


+ 




+ 


1D8 


+ 


+ 


- 


+ 


+ 


+ 




+ 


10G9 


+ 






+ 


+ 


+ 




+ 


10A2 


+ 






+ 


+ 


+ 




+ 


3E8 


+ 






+ 


+ 


+ 




+ 


10D7 


+ 






+ 


+ 


+ 




+ 


2H7 


+ 








+ 








6H7 


+ 








+ 








3A4 










+ 


+ 


+ 




14H6 










+ 


+ 


+ 




7G2 










+ 


+ 




+ 


13H10 










+ 






+ 


7E8 










+ 








7H6 










+ 









a Mabs listed in this table were not reactive with recombinant BVH-3 molecule 
5 



The results obtained from the immunoreactivity studies of 
the Mabs (Table 4 and Table 5) are in agreement with the 
protein sequences derived from the respective gene 
sequences. Indeed the Mabs cross-reactive with BVH-3 and 
10 BVH-11 molecules recognized BVH-3C protein corresponding to 
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This example illustrates the simultaneous expression of 
BVH-3 and BVH-11 gene products by S^ pneumoniae . 

A standard Western blot technique was used to investigate 
5 whether BVH-3 and BVH-11 genes were expressed in S. 

pneumoniae. pneumoniae strain SP64 and SP63 were grown 

overnight at 37°C in 5% C0 2 on chocolate agar plates, 
bacteria were suspended in PBS and heat-killed at 56°C for 
20 min. For the preparation of antigens, suspensions of 

10 pneumoniae were treated with sample buffer containing SDS 
and 2-mercaptoethanol for 5 min at 100°C. Pneumococcal 
protein antigens were resolved by SDS-PAGE electrophoresis 
according to the method of Laemmli [Nature, 227, pp. 680- 
685 (1970)]. After SDS-PAGE, the proteins were transferred 

15 electrophoretically from the gel to nitrocellulose paper by 
the method of Towbin [Proc. Natl. Acad. Sci. USA, 76, pp. 
4350-4354 (1979)] and probed with mouse antiserum or 
monoclonal antibodies. The detection of antigens reactive 
with the antibodies was performed by indirect enzyme- 

20 immunoassay using conjugated-anti-mouse immunoglobulins and 
a colour substrate. When antiserum raised to recombinant 
BVH-3 was tested against pneumoniae SP64 antigens, two 
reactive bands having apparent molecular masses of 127 kDa 
and 99 kDa were detected. Bands having the same apparent 

25 molecular masses were also detected when Mabs H3-1-F9, H3- 
1-D4, H3-1-H12, H11-1-E7, H11-1-H10 and Hll-1 . 1-G11 were 
used individually as immunological probes. In contrast, 
Mabs specific for the BVH-3 molecule detected the 127 kDa 
band only and Mabs specific for BVH-11 detected the 99 kDa 

30 band only thus confirming the identity of the 127 and 99 
kDa bands as BVH-3 and BVH-11, respectively. These studies 
provide evidence that BVH-3 and BVH-11 proteins are 
simultaneously present on S^ pneumoniae . Moreover, the 
results are consistent with our previous observations that 

35 BVH-3 and BVH-11 possess epitopes that are common to both 
proteins and epitopes that are exclusive to either protein. 
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In S^ pneumoniae SP64, mature BVH-3, BVH-11 and BVH-11-2 
are proteins of 1019, 821 and 819 amino acids with 
predicted molecular mass of 112.5 kDa, 92.4 kDa, and 91.7 
5 kDa, respectively. Although there is a discrepancy between 
the molecular mass predicted from the sequence and the 
molecular mass calculated on SDS-PAGE, BVH-3 can be 
distinguished from BVH-11 by its higher molecular mass. 
Moreover, BVH-3 molecules from pneumoniae strain SP63 
10 have an apparent molecular mass of 112 kDa in SDS-PAGE 

compared to 127 kDa for BVH-3 of SP64 strain. This data is 
consistent with the deletion of a stretch of 177 amino acid 
residues in BVH-3 of S^ pneumoniae strain SP63. 

15 

EXAMPLE 11 

This example describes the protection conferred in 
experimental infection of mice vaccinated with recombinant 
20 BVH-3 or BVH-11 gene products. 

Groups of 7 or 8 female BALB/c mice (Charles River) were 
immunized subcutaneous ly three times at three-week 
intervals with either affinity purified thioredoxin- 

25 His»Tag-BVH-3M fusion protein, affinity purified 

thioredoxin-His«Tag-BVH-llM fusion protein or, as control, 
with QuilA adjuvant alone in PBS. Twelve to 14 days 
following the third immunization, the mice were challenged 
intravenously with pneumoniae WU2 strain or intranasally 

30 with P4241 strain. Samples of the S^ pneumoniae challenge 
inoculum were plated on chocolate agar plates to determine 
the CFU and to verify the challenge dose. The challenge 
dose was approximately 10* CFU. Deaths were recorded for a 
period of 14 days and on day 14 post-challenge, the 

35 surviving mice were sacrificed and blood samples tested for 
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the presence of £L_ pneumoniae organisms. The survival data 
are shown in Tables 6 and 7 . 



5 



Table 6. Protection mediated by recombinant BVH-3M and 
BVH-11M proteins in experimental infection with virulent S. 
pneumoniae WU2 



Experiment 


Immunogen 


Alive : dead* 


Median days alive 


1 


BVH-3M 
none 


8 : 0 
0 : 8 


>14 

1 


2 


BVH-11M 
none 


8 : 0 
0 : 8 


>14 
1 



10 



14 post-challenge. 



the number of mice dead on day 



Table 7. Protection mediated by recombinant BVH-3M and 
BVH-11M proteins in experimental pneumonia with- virulent £L_ 
15 pneumoniae P4241 



Experiment 


Immunogen 


Alive 


dead* 


Median day alive 


1 


BVH-3M 


6 


1 




>14 




none 


1 


7 




4.5 


2 


BVH-3M 


8 


. 0 




>14 




BVH-11M 


8 


0 




>14 




none 


0 


8 




4 



The number of mice alive : the number of mice dead on day 
14 post-challenge. 



All mice immunized with recombinant BVH-3M or BVH-11M 
20 protein survived to infection with WU2 while none of the 

control mice given adjuvant alone survived. All except one 
mice immunized with recombinant BVH-3M or BVH-11M protein 
survived to infection with P4241 while only one control 
mice given adjuvant alone survived. All hemocultures from 



58 



BNSDOCID; <WO_0039299A2J_> 



WO 00/39299 



PCT/CA99/01218 



surviving mice were negative at day 14 post-challenge. 
These results clearly indicate that both, BVH-3M and BVH- 
11M, elicit protective ant i -pneumococcal immune responses 
in mice. The fact that these proteins are highly conserved 
5 among pneumoniae isolates emphasize the potential of 
BVH-3 and BVH-11 as universal vaccine candidates. Indeed, 
the BVH-3 and BVH-11 proteins from serogroup 6 S . 
pneumoniae strain SP64 elicited protection against 
pneumococcal infections with strains of different capsular 
10 serotypes . 

Ideally, a vaccine that could protect against pneumococcal 
disease, could protect against meningitis, otitis media, 
bacteremia and pneumonia. BVH-3 and BVH-11 were protective 
15 against lethal systemic- and pneumonia-infection models 

thus suggesting that, in humans, BVH-3- and BVHll-protein- 
based vaccines could reduce the incidence of a wide 
spectrum of disease caused by virtually all pneumoniae 
independently of the capsular serotype. 

20 

Data from Tables 6 and 7 clearly demonstrate that BVH-3 and 
BVH-11 were, both, protection-eliciting molecules of S . 
pneumoniae . It was not known, however, whether protection 
can be mediated by specific sequences that were not shared 

25 on BVH-3 and BVH-11 molecules. Groups of female BALB/c 
mice (Charles River) were immunized subcutaneous ly three 
times at three-week intervals with either affinity purified 
thioredoxin-His«Tag- BVH-3 AD, -BVH-3B or -BVH-3C fusion 
protein in presence of 15 tig of QuilA adjuvant (Cedarlane 

30 Laboratories Ltd, Hornby, Canada) . Control mice were 
immunized with QuilA adjuvant alone in PBS or affinity 
purified thioredoxin-His«Tag or thioredoxin-His«Tag- fusion 
protein (His-Thio) in presence of QuilA. 

35 To determine the protective ability of a set of truncated 
proteins, termed NEW4, NEW5, NEWS , NEW7, NEW8, NEW9, NEW10, 
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NEW11, NEW14 and BVH-11B, groups of female BALB/c mice 
(Charles River) were immunized subcutaneously two times at. 
three-week intervals with 25 ug of either affinity purified 
His»Tag-fusion protein in presence of 15 nq of QuilA 
5 adjuvant. Ten to 14 days following the last immunization, 
the mice were challenged with virulent pneumoniae . Our 
results indicate that, BVH-3B, a truncated BVH-3 molecule 
consisting of amino acids 512-1039, elicited protection 
against the mouse-virulent strains WU2 and P4241. 

10 Similarly, BVH-11B, NEW4 and NEWS molecules, three 

truncated BVH-11 molecules consisting of amino acids 354- 
840, amino acids 286-840 and amino acids 286-713, 
respectively, elicited protection against experiment 
intravenous challenge with WU2 and intranasal challenge 

15 with P4241. Moreover, vaccination with NEW10 and NEW14, 
consisting of amino acids 272-838 and amino acids 227-699 
from BVH-11-2 molecule also resulted in protection against 
death with the pneumococcal strains. These results 
indicate that the region comprising 428 amino acids 

20 extending from amino acids 286-713 and amino acids 272-699 
on pneumoniae SP64 BVH-11 and BVH-11-2 protein 
sequences, respectively, contains protective epitopes. 
This region is highly conserved with a global 91% identity 
and 94% homology among thirteen BVH-11 protein sequences. 

25 
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Table 8. Evaluation of protection elicited by vaccination 
of mice with BVH-3 and BVH-11 gene products 





Challenge with WU2 


Challenge with P4241 


Experiment 


Immunogen 


Alive : dead 3 


Median day 


Alive : dead 


Median day 








alive 




alive 


r 


None 


0:8. 


1.5 


1 : 7 


4.5 




NEW4 


8 : 0 


>14 


8 : 0 


>14 




NEWS 


8 : 0 


>14 


8 : 0 


>14 




NEW7 


0 : 8 


2 


0 : 8 


5 




BVH-11M 


8 : 0 


>14 


8 : 0 


>14 


2° 


None 


0 : 8 


1 


0 : 8 


4 




NEW5 


8 : 0 


>14 


8 : 0 


>14 




NEW8 


0 : 8 


1.5 


0 : 8 


5.5 




NEW 9 


3 : 5 


3.5 


2 : 6 


7 




BVH-11M 


8 : 0 


>14 


8 : 0 


>14 


3 b 


None 


0 : 8 


1 


0 : 8 


4 




NEW 6 


0 : 8 


1 


4 : 4 


10. 5 C 




NEW10 


8 : 0 


>14 


8 : 0 


>14 




NEW11 


0 : 8 


1.5 


1 : 7 


6 




BVH-11M 


8 : 0 


>14 


8 : 0 


>14 


4 b 


None 


0 : 8 


2 


0 : 8 


4 




BVH-11B 


7 : 1 


>14 


8 : 0 


>14 




NEW14 


8 : 0 


>14 


8 : 0 


>14 


5 


His-Thio 


0 : 8 


2 








BVH-3 AD 


1 : 7 


2.5 








BVH-3B 


5 : 3 


>14 






6 


His-Thio 


0 : 8 


1 








BVH-3C 


0 : 8 


1 







The number of mice alive : the number of mice dead on day 

14 post-challenge . 
5 b The WU2 challenge dose was 10 5 CFU. 

Mice living longer than 14 days were assigned a survival 

time of 14 days for the determination of median values. 
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EXAMPLE 12 

This example described the cloning and expression of a 
5 chimeric gene encoding for a chimeric polypeptide 

corresponding to the carboxy- terminal region of BVH-3 in 
fusion at the C end to the carboxy- terminal region of BVH- 
11 and the additive protection observed after vaccination 
with a chimeric polypeptide. 

10 

It is clear from the studies described above that BVH-3 and 
BVH-11 are serologically distinct molecules simultaneously 
present on pneumoniae . The results of immunological 
studies of mice indicate that both proteins are good 

15 vaccine candidates. These proteins have the potential to 
provide protection against all pneumococci, regardless of 
serotype. Even though the two proteins share epitopes and 
sequences, they have different characteristics and may 
serve different biological functions. Thus, immunization 

20 against the two proteins may provide a higher level of 
protection than that imparted by each individually. To 
examine this, several avenues where full-length or 
truncated BVH-3 and BVH-11 are administered in combination 
or in conjugation can be explored. Here we describe the 

25 genetic engineering of a BVH-3-BVH-11 fusion gene and 
protein, termed NEW12 (SEQ ID NO: 76 and SEQ ID NO:58, 
respectively) , and the potential use of NEW12 protein as a 
vaccine . 

30 BVH-3 and BVH-11 gene fragments corresponding to the 3 ' end 
of the genes were amplified by PCR using pairs of 
oligonucleotides engineered to amplify fragments spanning 
nucleotides 1414 to 3117 (SEQ ID NO: 1) and nucleotides 1060 
to 2520 (SEQ ID NO: 3) from Pneumoniae strain SP64 BVH-3 

35 and BVH-11 genes, respectively. The primers used, HAMJ278 
and HAMJ279; HAMJ282 and HAMJ283 had a restriction 
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endonuclease site at the 5' end, thereby allowing 
directional in- frame cloning of the amplified product into 
the digested pET21b(+) plasmid vector (Table 2). PCR- 
amplified products were digested with restriction 
5 endonucleases and ligated to linearized plasmid pET21b(+) 
vector digested likewise. The resultant plasmid constructs 
were confirmed by nucleotide sequence analysis. The 
recombinant pET21b(+) plasmid containing the Ndel-Hindlll 
BVH-3 PCR product was linearized by digestion with the 

10 restriction enzymes Hindi 1 1 and NotI for the in-frame 

cloning of the Hindlll-NotI DNA fragment obtained from the 
recombinant pET21(+) vector containing the BVH-11 gene 
fragment. Clones were first stabilized in E^ coli DH5a 
before introduction into E_^ coli BL21(XDE3) for expression 

15 of a chimeric pneumococcal protein molecule. The 

recombinant chimeric polypeptide, termed NEW 12, was 
expressed as C- terminal fusion with an His- tag. The 
expressed recombinant NEW 12 protein was purified from 
supernatant fractions obtained from centrifugation of 

20 sonicated IPTG-induced E^_ coli cultures using a His-Bind 
metal chelation resin (QIAgen, Chatsworth, CA) . 

According to the same procedure described above, it is 
possible to construct other chimeric polypeptides, as a 

25 result of a simultaneous expression of New 1 and New 4, New 
1 and New 5, New 1 and New 10, or New 1 and New 14. The 
construction can be with New 1 upstream or downstream of 
New 4, New 5, New 10, BVH-11B or New 14. It is also 
possible to construct other chimeric polypeptides as a 

30 result of a simultaneous expression of more than two 

fragments of either genes of BVH-3, BVH-11 or BVH-11-2; 
i 

Groups of 8 female BALB/c mice (Charles River) were 
immunized subcutaneously two times at three-week intervals 
35 with 25 of either affinity purified His«Tag- fusion NEW1, 
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BVH-11B or NEW12 protein in presence of 15 ug of QuilA 
adjuvant. Ten to 14 days following the last immunization, 
the mice were challenged with virulent pneumoniae . As 
demonstrated before, NEW1 and BVH-11B molecules comprising 
5 amino acids 472 to 1039 from BVH-3 protein and amino acids 
354-840 from BVH-11 protein, respectively, correspond to 
portions of the proteins capable of eliciting a protective 
immune response. To determine if a chimeric polypeptide 
would significantly improve the protection compared with 

10 those seen for the individual counterparts, the challenge 
dose was adjusted in a manner that protection was not 
expected with NEW1 and BVH-11B molecules. Interestingly, 
the chimeric NEW12 protein, elicited protection against the 
mouse-virulent strains WU2 and P4241. Seven out of 8 mice 

15 immunized with NEW12 were still alive 10 days after the 

challenge while 28 out of 32 mice immunized with NEW1, BVH- 
11B, BVH-3M or adjuvant alone were dead by five days post- 
challenge. Thus, vaccination of mice with NEW12 provided 
the highest degree of protection against WU2 challenge. 

20 These results indicate that immunization with a chimeric 
polypeptide and possibly a combination of BVH-3 and BVH-11 
gene products can provide additional protection to that 
obtained by administration of BVH-3 or BVH-11 antigens 
alone. 

25 

Table 9. Evaluation of protection elicited by vaccination 
of mice with the chimeric NEW12 molecule 





Challenge with WU2 


Challenge with P4241 


Immunogen 


Alive : dead 3 


Median day 


Alive : dead 


Median day 






alive 




alive 


None 


0 : 8 


1 


0 : 8 


5 


NEW1' 


2 : 6 


2 


1 : 7 


8 


BVH-11B 


1 : 7 


3.5 


8 : 0 


>14 


NEW12 


6 : 2 


>14 


7 : 1 


>14 


BVH-3M 


1 : 7 


3 


8 : 1 


>14 
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EXAMPLE 13 

5 

This example illustrates the identification of additional 
BVH-3 and BVH-11 related sequences in Streptococcus species 
other than pneumoniae . 

10 It was previously shown that BVH-3, BVH-11 and BVH-11-2 are 
a family of related proteins sharing common sequences. 
Homology searches were performed with the nucleotide 
sequence from the conserved region of these genes and 
compared with GenBank and EMBL sequences using FASTA. The 

15 most significant homology was observed with a 2.469-kb gene 
coding for a calculated 92-kDa protein (SEQ ID NO: 81) of 
unknown function in S^ agalactiae also called group B 
streptococcus or GBS. The gene was designated BVH-71. A 
protein demonstrating 99.2% identity and 99.5% similarity 

20 with that of GBS was also identified in pyogenes also 

called group A streptococcus or GAS (SEQ ID NO: 83) . The 5' 
region of the BVH-71 sequences <SEQ ID NO: 80 and SEQ ID 
NO: 82), spanning nucleotides 1 to 717, demonstrated 58 and 
60% identity with the conserved regions of BVH-3 

25 (nucleotides 1 to 675) and BVH-11 (nucleotides 1 to 684) 
genes respectively. The first 239 amino acids of the 
translated sequences of the GBS and GAS BVH-71 open reading 
frames are 51 and 54% identical to the first 225 and 228 
amino acids of BVH-3 and BVH-11, respectively. In addition 

30 to structural similarities, streptococcal BVH-3, BVH-11 and 
BVH-71 proteins also share antigenic epitopes. A 97-kDa 
band was revealed on Western blots of GAS or GBS whole 
cells, using Mab Hll-l.i-Gll reactive with the BVH-3 and 
BVH-11 conserved regions. Similarly, GAS and GBS 
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recombinant BVH-71 proteins were detected in Western 
immunoblot analysis. 

These results indicate that BVH-71, BVH-3 and BVH-11 
proteins might share similar functions. Our results also 
suggest that BVH-71 proteins can be used as protein vaccine 
components of anti - streptococcus . In a further embodiment 
BVH-71 proteins can be used as protein vaccine components 
of anti-GAS or anti-GBS vaccines. 
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What is claimed is: 

1. An isolated polynucleotide encoding a polypeptide having 
at least 70% identity to a second polypeptide having a 
sequence chosen from: SEQ ID NOs; 2, 4, 6, 8, 10, 14, 16, 
55 to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof . 

2. A polynucleotide according to claim 1, wherein said 
polynucleotide encodes a polypeptide having at least 95% 
identity to the second polypeptide. 

3 . An isolated polynucleotide encoding a polypeptide capable 
of generating antibodies having binding specificity for a 
polypeptide having a sequence chosen from: SEQ ID NOs: 2, 
4, 6, 8, 10, 14, 16, 55 to 75, 77 to 79, 81, 83 or 
fragments, analogs or derivatives thereof. 

4. An isolated polynucleotide that is complementary to the 
polynucleotide of claim 1. 

5. An isolated polynucleotide that is complementary to the 
polynucleotide of claim 3. 

6. The polynucleotide of claim 1, wherein said 
polynucleotide is DNA. 

7. The polynucleotide of claim 3, wherein said 
polynucleotide is DNA. 

8. The polynucleotide of claim 1, wherein said 
polynucleotide is RNA. 

9. The polynucleotide of claim 3, wherein said 
polynucleotide is RNA. 
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10. A vector comprising the polynucleotide of claim 1, 
wherein said DNA is operably linked to an expression 
control region. 

11. A vector comprising the polynucleotide of claim 3, 
wherein said DNA is operably linked to an expression 
control region. 

12. A host cell transfected with the vector of claim 10. 

13. A host cell transfected with the vector of claim 11. 

14. A process for producing a polypeptide comprising 
culturing a host cell according to claim 12 under 
conditions suitable for expression of said polypeptide. 

15. A process for producing a polypeptide comprising 
culturing a host cell according to claim 13 under 
condition suitable for expression of said polypeptide. 

16. An isolated polypeptide having at least 70% identity to a 
second polypeptide having an amino acid sequence chosen 
from: SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 77 
to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 

17 . An isolated polypeptide capable of generating antibodies 
having binding specificity for a second polypeptide 
having a sequence chosen from: SEQ ID NOs: 2, 4, 6, 8, 
10, 14, 16, 55 to 75, 77 to 79, 81, 83 or fragments, 
analogs or derivatives thereof. 

18. An isolated polypeptide having an amino acid sequence 

chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 to 75, 
77 to 79, 81, 83 or fragments, analogs or derivatives 
thereof . 
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19. An isolated polypeptide according to claim 18, wherein the 
N- terminal Met residue is deleted. 

20. An isolated polypeptide according to claim 18, wherein the 
secretory amino acid sequence is deleted. 

21. A chimeric polypeptide comprising two or more 
polypeptides chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 
16, 55 to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof; provided that the polypeptides or 
fragments, analogs or derivatives thereof are linked as 
to form a chimeric polypeptide. 

22. A chimeric polypeptide comprising two or more polypeptides 

chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 69, 
72, 74, 77 or fragments, analogs or derivatives thereof; 
provided that the polypeptides or fragments, analogs or 
derivatives thereof are linked as to form a chimeric 
polypeptide. 

23. A chimeric polypeptide of formula (I): 
A-(B) B -(C). a -D (I) 

Wherein; 

m is 0 or 1, 

n is 0 or 1, 

A is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 
to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof; 

B is chosen from SEQ ID NOs : 2, 4, 6, 8, 10, 14, 16, 55 
to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof ; 

C is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 
to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof; and 

D is chosen from SEQ ID NOs: 2, 4, 6, 8, 10, 14, 16, 55 
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to 75, 77 to 79, 81, 83 or fragments, analogs or 
derivatives thereof . 

24. A chimeric polypeptide of formula (I): 
A-(B) B -(C),-D (I) 

Wherein; 

m is 0 or 1, 

n is 0 or 1, 

A is chosen from SEQ ID NOs tlO, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof; 

B is chosen from SEQ ID NOs ;10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77, or fragments, analogs or derivatives 
thereof ; 

C is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof; and 

D is chosen from SEQ ID NOs :10, 58, 60, 62, 64, 67, 68, 

69, 72, 74, 77 or fragments, analogs or derivatives 
thereof . 

25. A vaccine composition comprising a polypeptide according 
to any one of claims 16 to 24 and a pharmaceutical ly 
acceptable carrier, diluent or adjuvant. 

26. A method for therapeutic or prophylactic treatment of 
meningitis, otitis media, bacteremia or pneumonia 
infection in an individual susceptible to meningitis, 
otitis media, bacteremia or pneumonia infection comprising 
administering to said individual a therapeutic or 
prophylactic amount of a composition according to claim 
25. 

27. A method for therapeutic or prophylactic treatment of 
streptococcal bacterial infection in an individual 
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susceptible to streptococcal infection comprising 
administering to said individual a therapeutic or 
prophylactic amount of a composition according to claim 
25. 

28. A method according to claim 26, wherein said individual is 
a mammal . 

29. A method according to claim 27, wherein said individual is 
a human. 

30. A method according to claim 22, wherein said bacterial 
infection is S. pneumoniae , group A streptococcus 
{pyogenes) , group B streptococcus (GBS or agalactiae) , 
dysgalactiae, uteris, nocardia or Staphylococcus aureus. 

31. A method according to claim 26, wherein said bacterial 
infection is S .pneumoniae . 

32. Use of a vaccine composition according to claim 25 for 
the prophylactic or therapeutic treatment of 
Streptococcal infection in an animal susceptible to or 
infected with streptococcal infection comprising 
administering to said animal a prophylactic or 
therapeutic amount of the composition. 
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ATGAAATTTA GTAAAAAATA TATAGCAGCT GGATCAGCTG TTATCGTATC CTTGAGTCTA 60 

TGTGCCTATG CACTAAACCA GCATCGTTCG CAGGAAAATA AGGACAATAA TCGTGTCTCT 120 

TATGTGGATG GCAGCCAGTC AAGTCAGAAA AGTGAAAACT TGACACCAGA CCAGGTTAGC 180 

CAGAAAGAAG GAATTCAGGC TGAGCAAATT GTAATCAAAA TTACAGATCA GGGCTATGTA 240 

ACGTCACACG GTGACCACTA TCATTACTAT AATGGGAAAG TTCCTTATGA TGCCCTCTTT 300 

AGTGAAGAAC TCTTGATGAA GGATCCAAAC TATCAACTTA AAGACGCTGA TATTGTCAAT 360 

GAAGTCAAGG GTGGTTATAT CATCAAGGTC GATGGAAAAT ATTATGTCTA CCTGAAAGAT 420 

GCAGCTCATG CTGATAATGT TCGAACTAAA GATGAAATCA ATCGTCAAAA ACAAGAACAT 4 80 

GTCAAAGATA ATGAGAAGGT TAACTCTAAT GTTGCTGTAG CAAGGTCTCA GGGACGATAT 54 0 

ACGACAAATG ATGGTTATGT CTTTAATCCA GCTGATATTA TCGAAGATAC GGGTAATGCT 600 

TATATCGTTC CTCATGGAGG TCACTATCAC TACATTCCCA AAAGCGATTT ATCTGCTAGT 660 

GAATTAGCAG CAGCTAAAGC ACATCTGGCT GGAAAAAATA TGCAACCGAG TCAGTTAAGC 720 

TATTCTTCAA CAGCTAGTGA CAATAACACG CAATCTGTAG CAAAAGGATC AACTAGCAAG 780 

CCAGCAAATA AATCTGAAAA TCTCCAGAGT CTTTTGAAGG AACTCTATGA TTCACCTAGC 84 0 

GCCCAACGTT ACAGTGAATC AGATGGCCTG GTCTTTGACC CTGCTAAGAT TATCAGTCGT 900 

ACACCAAATG GAGTTGCGAT TCCGCATGGC GACCATTACC ACTTTATTCC TTACAGCAAG 960 

CTTTCTGCTT TAGAAGAAAA GATTGCCAGA ATGGTGCCTA TCAGTGGAAC TGGTTCTACA 1020 

GTTTCTACAA ATGCAAAACC TAATGAAGTA GTGTCTAGTC TAGGCAGTCT TTCAAGCAAT 1080 

CCTTCTTCTT TAACGACAAG TAAGGAGCTC TCTTCAGCAT CTGATGGTTA TATTTTTAAT 114 0 

CCAAAAGATA TCGTTGAAGA AACGGCTACA GCTTATATTG TAAGACATGG TGATCATTTC 1200 

CATTACATTC CAAAATCAAA TCAAATTGGG CAACCGACTC TTCCAAACAA TAGTCTAGCA 1260 

ACACCTTCTC CATCTCTTCC AATCAATCCA GGAACTTCAC ATGAGAAACA TGAAGAAGAT 1320 

GGATACGGAT TTGATGCTAA TCGTATTATC GCTGAAGATG AATCAGGTTT TGTCATGAGT 1380 

CACGGAGACC ACAATCATTA TTTCTTCAAG AAGGACTTGA CAGAAGAGCA AATTAAGGCT 144 0 

GCGCAAAAAC ATTTAGAGGA AGTTAAAACT AGTCATAATG GATTAGATTC TTTGTCATCT 1500 

CATGAACAGG ATTATCCAGG TAATGCCAAA GAAATGAAAG ATTTAGATAA AAAAATCGAA 1560 

GAAAAAATTG CTGGCATTAT GAAACAATAT GGTGTCAAAC GTGAAAGTAT TGTCGTGAAT 1620 

AAAGAAAAAA ATGCGATTAT TTATCCGCAT GGAGATCACC ATCATGCAGA TCCGATTGAT 1680 

GAACATAAAC CGGTTGGAAT TGGTCATTCT CACAGTAACT ATGAACTGTT TAAACCCGAA 1740 

GAAGGAGTTG CTAAAAAAGA AGGGAATAAA GTTTATACTG GAGAAGAATT AACGAATGTT 1800 

GTTAATTTGT TAAAAAATAG TACGTTTAAT AATCAAAACT TTACTCTAGC CAATGGTCAA 1860 

AAACGCGTTT CTTTTAGTTT TCCGCCTGAA TTGGAGAAAA AATTAGGTAT CAATATGCTA 1920 

GTAAAATTAA TAACACCAGA TGGAAAAGTA TTGGAGAAAG TATCTGGTAA AGTATTTGGA 1980 

GAAGGAGTAG GGAATATTGC AAACTTTGAA TTAGATCAAC CTTATTTACC AGGACAAACA 2040 

TTTAAGTATA CTATCGCTTC AAAAGATTAT CCAGAAGTAA GTTATGATGG TACATTTACA 2100 

GTTCCAACCT CTTTAGCTTA CAAAATGGCC AGTCAAACGA TTTTCTATCC TTTCCATGCA 2160 

GGGGATACTT ATTTAAGAGT GAACCCTCAA TTTGCAGTGC CTAAAGGAAC TGATGCTTTA 2220 

GTCAGAGTGT TTGATGAATT TCATGGAAAT GCTTATTTAG AAAATAACTA TAAAGTTGGT 2280 

GAAATCAAAT TACCGATTCC GAAATTAAAC CAAGGAACAA CCAGAACGGC CGGAAATAAA 2340 

ATTCCTGTAA CCTTCATGGC AAATGCTTAT TTGGACAATC AATCGACTTA TATTGTGGAA 2400 

GTACCTATCT TGGAAAAAGA AAATCAAACT GATAAACCAA GTATTCTACC ACAATTTAAA 2460 

AGGAATAAAG CACAAGAAAA CTCAAAACTT GATGAAAAGG TAGAAGAACC AAAGACTAGT 2520 

GAGAAGGTAG AAAAAGAAAA ACTTTCTGAA ACTGGGAATA GTACTAGTAA TTCAACGTTA 2580 

GAAGAAGTTC CTACAGTGGA TCCTGTACAA GAAAAAGTAG CAAAATTTGC TGAAAGTTAT 2640 

GGGATGAAGC TAGAAAATGT CTTGTTTAAT ATGGACGGAA CAATTGAATT ATATTTACCA 2700 

TCAGGAGAAG TCATTAAAAA GAATATGGCA GATTTTACAG GAGAAGCACC TCAAGGAAAT 2760 

GGTGAAAATA AACCATCTGA AAATGGAAAA GTATCTACTG GAACAGTTGA GAACCAACCA 2820 

ACAGAAAATA AACCAGCAGA TTCTTTACCA GAGGCACCAA ACGAAAAACC TGTAAAACCA 2880 

GAAAACTCAA CGGATAATGG AATGTTGAAT CCAGAAGGGA ATGTGGGGAG TGACCCTATG 2 940 

TTAGATCCAG CATTAGAGGA AGCTCCAGCA GTAGATCCTG TACAAGAAAA ATTAGAAAAA 3000 

TTTACAGCTA GTTACGGATT AGGCTTAGAT AGTGTTATAT TCAATATGGA TGGAACGATT 3 060 

GAATTAAGAT TGCCAAGTGG AGAAGTGATA AAAAAGAATT TATCTGATTT CAT AG CGT AA 312 0 
(SEQ ID NO: 1) FIGURE 1 
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MKFSKKYIAA GSAVIVSLSL CAYALNQHRS QENKDNNRVS YVDGSQSSQK 50 

SENLTPDQVS QKEGIQAEQI VIKITDQGYV TSHGDHYHYY NGKVPYDALF 100 

SEELLMKDPN YQLKDADIVN EVKGGYIIKV DGKYYVYLKD AAHADNVRTK 150 

DEINRQKQEH VKDNEKVNSN VAVARSQGRY TTNDGYVFNP ADIIEDTGNA 200 

YIVPHGGHYH YIPKSDLSAS ELAAAKAHLA GKKMQPSQLS YSSTASDNNT 250 

QSVAKGSTSK PANKSENLQS LLKELYDSPS AQRYSESDGL VFDPAKIISR 300 

TPNGVAI PHG DHYHFIPYSK LSALEEKIAR MVPISGTGST VSTNAKPNEV 350 

VSSLGSLSSN PSSLTTSKEL SSASDGYIFN PKDIVEETAT AYIVRHGDHF 400 
HYIPKSNQIG QPTLPNNSLA TPSPSLPINP GTSHEKHEED GYGFDANRII . 4 50 

AEDESGFVMS HGDHNHYFFK KDLTEEQIKA AQKHLEEVKT SHNGLDSLSS 500 

HEQDYPGNAK EMKDLDKKIE EKIAGIMKQY GVKRESIWN KEKNAI I YPH 550 

GDHHHADPID EHKPVGIGHS HSNYELFKPE EGVAKKEGNK VYTGEELTNV 600 

VNLLKNSTFN NQNFTLANGQ KRVSFSFPPE LEKKLGINML VKLITPDGKV 650 

LEKVSGKVFG EGVGNIANFE LDQPYLPGQT FKYTIASKDY PEVSYDGTFT 700 

VPTSLAYKMA SQTIFYPFHA GDTYLRVNPQ FAVPKGTDAL VRVFDEFHGN 750 

AYLENNYKVG EIKLPIPKLN QGTTRTAGNK I PVTFMANAY LDNQSTYIVE 800 

VPILEKENQT DKPSILPQFK RNKAQENSKL DEKVEEPKTS EKVEKEKLSE 850 

TGNSTSNSTL EEVPTVDPVQ EKVAKFAESY GMKLENVLFN MDGTIELYLP 900 

SGEVIKKNMA DFTGEAPQGN GENKPSENGK VSTGTVENQP TENKPADSLP 950 

EAPNEKPVKP ENSTDNGMLN PEGNVGSDPM LDPALEEAPA VDPVQEKLEK 1000 
FTASYGLGLD SVIFNMDGTI ELRLPSGEVI KKNLSDFIA (S2Q ID NO: 2) 103 9 

FIGURE 2 
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ATGAAAATCA ATAAAAAATA TCTAGCTGGG TCAGTAGCTA CACTTGTTTT AAGTGTCTGT 60 

GCTTATGAAC TAGGTTTGCA TCAAGCTCAA ACTGTAAAAG AAAATAATCG TGTTTCCTAT 120 

ATAGATGGAA AACAAGCGAC GCAAAAAACG GAGAATTTGA CTCCTGATGA GGTTAGCAAG 18 0 

CGTGAAGGAA TCAACGCCGA ACAAATCGTC ATCAAGATTA CGGATCAAGG TTATGTGACC 24 0 

TCTCATGGAG ACCATTATCA TTACTATAAT GGCAAGGTCC CTTATGATGC CATCATCAGT 300 

GAAGAGCTCC TCATGAAAGA TCCGAATTAT CAGTTGAAGG ATTCAGACAT TGTCAATGAA 3 60 

ATCAAGGGTG GTTATGTCAT TAAGGTAAAC GGTAAATACT ATGTTTACCT TAAGGATGCA 42 0 

GCTCATGCGG ATAATGTCCG TACAAAAGAA GAAATCAATC GGCAAAAACA AGAACATAGT 4 80 

CAGCATCGTG AAGGAGGGAC TTCAGCAAAC GATGGTGCGG TAGCCTTTGC ACGTTCACAG 54 0 

GGACGCTACA CCACAGATGA TGGTTATATC TTCAATGCAT CTGATATCAT CGAAGATACG 600 

GGCGATGCCT ATATCGTTCC TCATGGAGAT CATTACCATT ACATTCCTAA GAATGAGTTA 660 

TCAGCTAGCG AGTTGGCTGC TGCAGAAGCC TTCCTATCTG GTCGGGAAAA TCTGTCAAAT 720 

TTAAGAACCT ATCGCCGACA AAATAGCGAT AACACTCCAA GAACAAACTG GGTACCTTCT 780 

GTAAGCAATC CAGGAACTAC AAATACTAAC ACAAGCAACA ACAGCAACAC TAACAGTCAA 84 0 

GCAAGTCAAA GTAATGACAT TGATAGTCTC TTGAAACAGC TCTACAAACT GCCTTTGAGT 900 

CAACGCCATG TAGAATCTGA TGGCCTTATT TTCGACCCAG CGCAAATCAC AAGTCGAACC 960 

GCCAGAGGTG TAGCTGTCCC TCATGGTAAC CATTACCACT TTATCCCTTA TGAACAAATG 1020 

TCTGAATTGG AAAAACGAAT TGCTCGTATT ATTCCCCTTC GTTATCGTTC AAACCATTGG 1080 

GTACCAGATT CAAGACCAGA AGAACCAAGT CCACAACCGA CTCCAGAACC TAGTCCAAGT 1140 

CCGCAACCTG CACCAAATCC TCAACCAGCT CCAAGCAATC CAATTGATGA GAAATTGGTC 1200 

AAAGAAGCTG TTCGAAAAGT AGGCGATGGT TATGTCTTTG AGGAGAATGG AGTTTCTCGT 1260 

TATATCCCAG CCAAGAATCT TTCAGCAGAA ACAGCAGCAG GCATTGATAG CAAACTGGCC 1320 

AAGCAGGAAA GTTTATCTCA TAAGCTAGGA GCTAAGAAAA CTGACCTCCC ATCTAGTGAT 1380 

CGAGAATTTT ACAATAAGGC TTATGACTTA CTAGCAAGAA TTCACCAAGA TTTACTTGAT 1440 

AATAAAGGTC GACAAGTTGA TTTTGAGGCT TTGGATAACC TGTTGGAACG ACTCAAGGAT 1500 

GTCTCAAGTG ATAAAGTCAA GTTAGTGGAT GATATTCTTG CCTTCTTAGC TCCGATTCGT 1560 

CATCCAGAAC GTTTAGGAAA ACCAAATGCG CAAATTACCT ACACTGATGA TGAGATTCAA 1620 

GTAGCCAAGT TGGCAGGCAA GTACACAACA GAAGACGGTT ATATCTTTGA TCCTCGTGAT 1680 

ATAACCAGTG ATGAGGGGGA TGCCTATGTA ACTCCACATA TGACCCATAG CCACTGGATT 174 0 

AAAAAAGATA GTTTGTCTGA AGCTGAGAGA GCGGCAGCCC AGGCTTATGC TAAAGAGAAA 1800 

GGTTTGACCC CTCCTTCGAC AGACCATCAG GATTCAGGAA ATACTGAGGC AAAAGGAGCA 1860 

GAAGCTATCT ACAACCGCGT GAAAGCAGCT AAGAAGGTGC CACTTGATCG TATGCCTTAC 1920 

AATCTTCAAT ATACTGTAGA AGTCAAAAAC GGTAGTTTAA TCATACCTCA TTATGACCAT 1980 

TACCATAACA TCAAATTTGA GTGGTTTGAC GAAGGCCTTT ATGAGGCACC TAAGGGGTAT 204 0 

ACTCTTGAGG ATCTTTTGGC GACTGTCAAG TACTATGTCG AACATCCAAA CGAACGTCCG 2100 

CATTCAGATA ATGGTTTTGG TAACGCTAGC GACCATGTTC AAAGAAACAA AAATGGTCAA 2160 

GCTGATACCA ATCAAACGGA AAAACCAAGC GAGGAGAAAC CTCAGACAGA AAAACCTGAG 2220 

GAAGAAACCC CTCGAGAAGA GAAACCACAA AGCGAGAAAC CAGAGTCTCC AAAACCAACA 2280 

GAGGAACCAG AAGAAGAATC ACCAGAGGAA TCAGAAGAAC CTCAGGTCGA GACTGAAAAG 2 340 

GTTGAAGAAA AACTGAGAGA GGCTGAAGAT TTACTTGGAA AAATCCAGGA TCCAATTATC 2400 

AAGTCCAATG CCAAAGAGAC TCTCACAGGA TTAAAAAATA ATTTACTATT TGGCACCCAG 2460 

GACAACAATA CTATTATGGC AGAAGCTGAA AAACTATTGG CTTTATTAAA GGAGAGTAAG 2520 

TAA (SEQ ID NO: 3) 2523 
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MKINKKYLAG SVATLVLSVC AYELGLHQAQ TVKENNRVSY IDGKQATQKT 50 

ENLTPDEVSK REGINAEQIV IKITDQGYVT SHGDHYHYYN GKVPYDAIIS 100 

EELLMKDPNY QLKDSDIVNE IKGGYVIKVN GKYYVYLKDA AHADNVRTKE 150 

EINRQKQEHS QHREGGTSAN DGAVAFARSQ GRYTTDDGYI FNASDIIEDT 200 

GDAYIVPHGD HYHYIPKNEL SASELAAAEA FLSGRENLSN LRTYRRQNSD 250 

NTPRTNWVPS VSNPGTTNTN TSNNSNTNSQ ASQSNDIDSL LKQLYKLPLS 300 

QRHVESDGLI FDPAQITSRT ARGVAVPHGN HYHFIPYEQM SELEKRIARI 350 

IPLRYRSNHW VPDSRPEEPS PQPTPEPSPS PQPAPNPQPA PSNPIDEKLV 400 

KEAVRKVGDG YVFEENGVSR YIPAKNLSAE TAAGIDSKLA KQESLSHKLG 450 

AKKTDLPSSD REFYNKAYDL LARIHQDLLD NKGRQVDFEA LDNLLERLKD 500 

VSSDKVKLVD DILAFLAPIR HPERLGKPNA QITYTDDEIQ VAKLAGKYTT 550 

EDGYIFDPRD ITSDEGDAYV TPHMTHSHWI KKDSLSEAER AAAQAYAKEK 600 

GLTPPSTDHQ DSGNTEAKGA EAIYNRVKAA KKVPLDRMPY NLQYTVEVKN 650 

GSLIIPHYDH YHNIKFEWFD EGLYEAPKGY TLEDLLATVK YYVEHPNERP 700 

HSDNGFGNAS DHVQRNKNGQ ADTNQTEKPS EEKPQTEKPE EETPREEKPQ 750 

SEKPESPKPT EEPEEESPEE SEEPQVETEK VEEKLREAED LLGKIQDPII 800 
KSNAKETLTG LKNNLLFGTQ DNNTIMAEAE KLLALLKESK (SEQ ID NO: 4) 84 0 

FIGURE 4 



ATGGAGAATA TAGACATGTT TAAATCAAAT CATGAGCGAA GAATGCGTTA TTCCATTCGT 60 

AAATTTAGTG TAGGAGTAGC TAGCGTAGCT GTTGCCAGTC TTTTTATGGG AAGTGTTGTA 120 

CATGCGACAG AGAAAGAGGG AAGTACCCAA GCAGCCACTT CTTTTAATAG GGGAAATGGA 180 

AGTCAGGCAG AACAACGTGG AGAACTCGAT TTAGAACGAG ATAAGGCAAT GAAAGCGGTC 240 

AGTGAATATG TAGGAAAAAT GGTGAGAGAT GCCTATGTAA AATCAGATAG AAAACGACAT 300 

AAAAATACTG TAGCTCTAGT TAACCAGTTG GGAAACATTA AGAACAGGTA TTTGAATGAA 360 

ATAGTTCATT CAACCTCAAA AAGCCAACTA CAGGAACTGA TGATGAAGAG TCAATCAGAA 420 

GTAGATGAAG CTGTGTCTAA ATTTGAAAAG GACTCATTTT CTTCGTCAAG TTCAGGATCC 480 

TCCACTAAAC CAGAAACTCC GCAGCCGGAA AATCCAGAGC ATCAAAAACC AACAACTCCA 540 

TCTCCGGATA CCAAACCAAG CCCTCAACCA GAAGGCAAGA AACCAAGCGT ACCAGACATT 600 

AATCAGGAAA AAGAAAAAGC TAAGCTTGCT GTAGTAACCT ACATGAGCAA GATTTTAGAT 660 

GATATACAAA AACATCATCT GCAGAAAGAA AAACATCGTC AGATTGTTGC TCTTATTAAG 72 0 

GAGCTTGATG AGCTTAAAAA GCAAGCTCTT TCTGAAATTG ATAATGTAAA TACCAAAGTA 780 

GAAATTGAAA ATACAGTCCA CAAGATATTT GCAGACATGG ATGCAGTTGT GACTAAATTC 840 

AAAAAAGGCT TAACTCAGGA CACACCAAAA GAACCAGGTA ACAAAAAACC ATCTGCTCCA 900 

AAACCAGGTA TGCAACCAAG TCCTCAACCA GAGGTTAAAC CGCAGCTGGA AAAACCAAAA 960 

CCAGAGGTTA AACCGCAACC AGAAAAACCA AAACCAGAGG TTAAACCGCA GCCGGAAAAA 1020 

CCAAAACCAG AGGTTAAACC GCAGCCGGAA AAACCAAAAC CAGAGGTTAA ACCGCAGCCG 1080 

GAAAAACCAA AACCAGAGGT TAAACCGCAG CCGGAAAAAC CAAAACCAGA GGTTAAACCG 1140 

CAGCCGGAAA AACCAAAACC AGAGGTTAAA CCGCAGCCGG AAAAACCAAA ACCAGAGGTT 1200 

AAACCGCAGC CGGAAAAACC AAAACCAGAG GTTAAACCGC AGCCGGAAAA ACCAAAACCA 1260 

GAGGTTAAAC CGCAGCCGGA AAAACCAAAA CCAGAGGTTA AACCGCAACC AGAAAAACCA 1320 

AAACCAGAGG TTAAACCGCA ACCAGAAAAA CCAAAACCAG ATAATAGCAA GCCACAAGCA 1380 

GATGATAAGA AGCCATCAAC TACAAATAAT TTAAGCAAGG ACAAGCAACC TTCTAACCAA 144 0 

GCTTCAACAA ACGAAAAAGC AACAAATAAA CCGAAGAAGT CATTGCCATC AACTGGATCT 1500 

ATTTCAAATC TAGCACTTGA AATTGCAGGT CTTCTTACCT TGGCGGGGGC AACCATTCTT 1560 

GCTAAGAA^A GAATGAAATA G (SEQ ID NO: 5) 15 81 
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MENIDMFKSN HERRMRYSIR KFSVGVASVA VASLFMGSW HATEKEGSTQ 50 

AATSFNRGNG SQAEQRGELD LERDKAMKAV SEYVGKMVRD AYVKSDRKRH 100 

KNTVALVNQL GNIKNRYLNE IVHSTSKSQL QELMMKSQSE VDEAVSKFEK 150 

DSFSSSSSGS STKPETPQPE NPEHQKPTTP SPDTKPSPQP EGKKPSVPDI 200 

NQEKEKAKLA WTYMSKILD DIQKHHLQKE KHRQIVALIK ELDELKKQAL 250 

SEIDNVNTKV EIENTVHKIF ADMDAWTKF KKGLTQDTPK EPGNKKPSAP 300 

KPGMQPSPQP EVKPQLEKPK PEVKPQPEKP KPEVKPQPEK PKPEVKPQPE 350 

KPKPEVKPQP EKPKPEVKPQ PEKPKPEVKP QPEKPKPEVK PQPEKPKPEV 400 

KPQPEKPKPE VKPQPEKPKP EVKPQPEKPK PEVKPQPEKP KPEVKPQPEK 450 

PKPDNSKPQA DDKKPSTTNN LSKDKQPSNQ ASTNEKATNK PKKSLPSTGS 500 

ISNLALEIAG LLTLAGATIL AKKRMK (SEQ ID NO: 6) 526 

FIGURE 6 



ATGAAATTTA GTAAAAAATA TATAGCAGCT GGATCAGCTG TTATCGTATC CTTGAGTCTA 60 

TGTGCCTATG CACTAAACCA GCATCGTTCG CAGGAAAATA AGGACAATAA TCGTGTCTCT 120 

TATGTGGATG GCAGCCAGTC AAGTCAGAAA AGTGAAAACT TGACACCAGA CCAGGTTAGC 180 

CAGAAAGAAG GAATTCAGGC TGAGCAAATT GTAATCAAAA TTACAGATCA GGGCTATGTA 24 0 

ACGTCACACG GTGACCACTA TCATTACTAT AATGGGAAAG TTCCTTATGA TGCCCTCTTT 300 

AGTGAAGAAC TCTTGATGAA GGATCCAAAC TATCAACTTA AAGACGCTGA TATTGTCAAT 360 

GAAGTCAAGG GTGGTTATAT CATCAAGGTC GATGGAAAAT ATTATGTCTA CCTGAAAGAT 420 

GCAGCTCATG CTGATAATGT TCGAACTAAA GATGAAATCA ATCGTCAAAA ACAAGAACAT 480 

GTCAAAGATA ATGAGAAGGT TAACTCTAAT GTTGCTGTAG CAAGGTCTCA GGGACGATAT 540 

ACGACAAATG ATGGTTATGT CTTTAATCCA GCTGATATTA TCGAAGATAC GGGTAATGCT 600 

TATATCGTTC CTCATGGAGG TCACTATCAC TACATTCCCA AAAGCGATTT ATCTGCTAGT 660 

GAATTAGCAG CAGCTAAAGC ACATCTGGCT GGAAAAAATA TGCAACCGAG TCAGTTAAGC 720 

TATTCTTCAA CAGCTAGTGA CAATAACACG CAATCTGTAG CAAAAGGATC AACTAGCAAG 780 

CCAGCAAATA AATCTGAAAA TCTCCAGAGT CTTTTGAAGG AACTCTATGA TTCACCTAGC 84 0 

GCCCAACGTT ACAGTGAATC AGATGGCCTG GTCTTTGACC CTGCTAAGAT TATCAGTCGT 900 

ACACCAAATG GAGTTGCGAT TCCGCATGGC GACCATTACC ACTTTATTCC TTACAGCAAG 960 

CTTTCTGCTT TAGAAGAAAA GATTGCCAGA ATGGTGCCTA TCAGTGGAAC TGGTTCTACA 1020 

GTTTCTACAA ATGCAAAACC TAATGAAGTA GTGTCTAGTC TAGGCAGTCT TTCAAGCAAT 1080 

CCTTCTTCTT TAACGACAAG TAAGGAGCTC TCTTCAGCAT CTGATGGTTA TATTTTTAAT 1140 

CCAAAAGATA TCGTTGAAGA AACGGCTACA GCTTATATTG TAAGACATGG TGATCATTTC 1200 

CATTACATTC CAAAATCAAA TCAAATTGGG CAACCGACTC TTCCAAACAA TAGTCTAGCA 1260 

ACACCTTCTC CATCTCTTCC AATCAATCCA GGAACTTCAC ATGAGAAACA TGAAGAAGAT 1320 

GGATACGGAT TTGATGCTAA TCGTATTATC GCTGAAGATG AATCAGGTTT TGTCATGAGT 1380 

CACGGAGACC ACAATCATTA TTTCTTCAAG AAGGACTTGA CAGAAGAGCA AATTAAGGTG 144 0 

CGCAAAAACA TTTAG (SEQ ID NOt 7) 1455 

FIGURE 7 
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MKFSKKYIAA GSAVIVSLSL CAYALNQHRS 
SENLTPDQVS QKEGIQAEQI VIKITDQGYV 
SEELLMKDPN YQLKDADIVN EVKGGYIIKV 
DEINRQKQEH VKDNEKVNSN VAVARSQGRY 
YIVPHGGHYH YIPKSDLSAS ELAAAKAHLA 
QSVAKGSTSK PANKSENLQS LLKELYDSPS 
TPNGVAIPHG DHYHFIPYSK LSALEEKIAR 
VSSLGSLSSN PSSLTTSKEL SSASDGYIFN 
HYIPKSNQIG QPTLPNNSLA TPSPSLPINP 
AEDESGFVMS HGDHNHYFFK KDLTEEQIKV 



QENKDNNRVS YVDGSQSSQK 50 

TSHGDHYHYY NGKVPYDALF 100 

DGKYYVYLKD AAHADNVRTK 150 

TTNDGYVFNP ADIIEDTGNA 200 

GKNMQPSQLS YSSTASDNNT 250 

AQRYSESDGL VFDPAKIISR 300 

MVPISGTGST VSTNAKPNEV 350 

PKDIVEETAT AYIVRHGDHF 400 

GTSHEKHEED GYGFDANRII 450 

RKNI (SEQ ID NO: 8) 464 

FIGURE 8 



ATGAAAGATT TAGATAAAAA AATCGAAGAA AAAATTGCTG GCATTATGAA ACAATATGGT 
GTCAAACGTG AAAGTATTGT CGTGAATAAA GAAAAAAATG CGATTATTTA TCCGCATGGA 
GATCACCATC ATGCAGATCC GATTGATGAA CATAAACCGG TTGGAATTGG TCATTCTCAC 
AGTAACTATG AACTGTTTAA ACCCGAAGAA GGAGTTGCTA AAAAAGAAGG GAATAAAGTT 
TATAC TGGA G AAGAATTAAC GAATGTTGTT AATTTGTTAA AAAATAGTAC GTTTAATAAT 

CAAAACTTTA CTCTAGCCAA TGGTCAAAAA CGCGTTTCTT TTAGTTTTCC GCCTGAATTG JDU 

GAGAAAAAAT TAGGTATCAA TATGCTAGTA AAATTAATAA CACCAGATGG AAAAGTATTG 420 

GAGAAAGTAT CTGGTAAAGT ATTTGGAGAA GGAGTAGGGA ATATTGCAAA CTTTGAATTA 480 

GATCAACCTT ATTTACCAGG ACAAACATTT AAGTATACTA TCGCTTCAAA AGATTATCCA 540 

GAAGTAAGTT ATGATGGTAC ATTTACAGTT CCAACCTCTT TAGCTTACAA AATGGCCAGT 600 

CAAACGATTT TCTATCCTTT CCATGCAGGG GATACTTATT TAAGAGTGAA CCCTCAATTT 660 

GCAGTGCCTA AAGGAACTGA TGCTTTAGTC AGAGTGTTTG ATGAATTTCA TGGAAATGCT 720 

TATTTAGAAA ATAACTATAA AGTTGGTGAA ATCAAATTAC CGATTCCGAA ATTAAACCAA 780 

GGAACAACCA GAACGGCCGG AAATAAAATT CCTGTAACCT TCATGGCAAA TGCTTATTTG 840 

GACAATCAAT CGACTTATAT TGTGGAAGTA CCTATCTTGG AAAAAGAAAA TCAAACTGAT 900 

AAACCAAGTA TTCTACCACA ATTTAAAAGG AATAAAGCAC AAGAAAACTC AAAACTTGAT 960 

GAAAAGGTAG AAGAACCAAA GACTAGTGAG AAGGTAGAAA AAGAAAAACT TTCTGAAACT 1020 

GGGAATAGTA CTAGTAATTC AACGTTAGAA GAAGTTCCTA CAGTGGATCC TGTACAAGAA 1080 

AAAGTAGCAA AATTTGCTGA AAGTTATGGG ATGAAGCTAG AAAATGTCTT GTTTAATATG 114 0 

GACGGAACAA TTGAATTATA TTTACCATCA GGAGAAGTCA TTAAAAAGAA TATGGCAGAT 1200 

TTTACAGGAG AAGCACCTCA AGGAAATGGT GAAAATAAAC CATCTGAAAA TGGAAAAGTA 1260 

TCTACTGGAA CAGTTGAGAA CCAACCAACA GAAAATAAAC CAGCAGATTC TTTACCAGAG 1320 

GCACCAAACG AAAAACCTGT AAAACCAGAA AACTCAACGG ATAATGGAAT GTTGAATCCA 1380 
GAAGGGAATG TGGGGAGTGA CCCTATGTTA GATCCAGCAT TAGAGGAAGC TCCAGCAGTA 
GATCCTGTAC AAGAAAAATT AGAAAAATTT ACAGCTAGTT ACGGATTAGG CTTAGATAGT 
GTTATATTCA ATATGGATGG AACGATTGAA TTAAGATTGC CAAGTGGAGA AGTGATAAAA 
AAGAATTTAT CTGATTTCAT AGCGTAA (SEQ ID NO: 9) 



60 
120 
180 
240 
300 
360 



1440 
1500 
1560 
1587 
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MKDLDKKIEE KIAGIMKQYG 
HKPVGIGHSH SNYELFKPEE 
QNFTLANGQK RVSFSFPPEL 
GVGNIANFEL DQPYLPGQTF 
QTIFYPFHAG DTYLRVNPQF 
IKLPIPKLNQ GTTRTAGNKI 
KPSILPQFKR NKAQENSKLD 
EVPTVDPVQE KVAKFAESYG 
FTGEAPQGNG ENKPSENGKV 
NSTDNGMLNP EGNVGSDPML 
VIFNMDGTIE LRLPSGEVIK 



VKRESIWNK EKNAIIYPHG 
GVAKKEGNKV YTGEELTNW 
EKKLGINMLV KLITPDGKVL 
KYTIASKDYP EVSYDGTFTV 
AVPKGTDALV RVFDEFHGNA 
PVTFMANAYL DNQSTYIVEV 
EKVEEPKTSE KVEKEKLSET 
MKLENVLFNM DGTIELYLPS 
STGTVENQPT ENKPADSLPE 
DPALEEAPAV DPVQEKLEKF 
KNLSDFIA (SEQ ID NO : 

FIGURE 10 



DHHHADPIDE 50 

NLLKNSTFNN 100 

EKVSGKVFGE 150 

PTSLAYKMAS 200 

YLENNYKVGE 250 

PILEKENQTD 300 

GNSTSNSTLE 350 

GEVIKKNMAD 400 

APNEKPVKPE 450 

TASYGLGLDS 500 

10) 528 
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BVH3 


WU2 


1 


CAYALNQHRSQENKDNNRVSYVDGSOSSQKSENLTPDQVSOKEGIOAEQIVIKITDQGYV 


60 


BVH3 


RX1 


1 


CAYALNQHRSQENICDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 


60 


BVH3 JNR7/87 


1 


CAYAUfQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 


60 


BVH3 


SP64 


1 


CAYALNQHRSQENKDNNRVSYVDGSQSSQKSEMLTPDQVSOKEGIOAEQIVIKITDQGYV 


60 


BVH3 


P4241 


1 


CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 


60 


BVH3 


A66 


1 


CAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIOAEQIVIKITDQGYV 


60 


BVH3 


WU2 


61 


TSHGDHYHYYNGKVPYDALFSEELU4KDPNYQLKDADIVNEVKGGYIIKVTC 


120 


BVH3 


RX1 


61 


TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYI I KVDGKYYVYLKD 


120 


BVH3 JNR7/87 


61 


TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNBVKGGYI I KVDGKYYVYLKD 


120 


BVH3 


SP64 


61 


TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDAD I VNEVKGGYI I KVDGKYYVYLKD 


120 


BVH3 


P4241 


61 


TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYI I KVDGKYYVYLKD 


120 


BVH3 


A66 


61 


TSHGDHYHYYNGKVPYDALFSEEL1>1KDPNYQLKDADIVNEVICGGYIIKVDGKYYWLK^ 


120 


BVH3 


WU2 


121 


AAHAD^TKDEI>mQKOEHVKI)NEKVNSNVAVARSQGRYTTNDCYVra 


160 


BVH3 


RX1 


121 


AAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNA 


180 


BVH3 JNR7/87 


121 


AAHADNVRTKDEINRQKQEHVKDNEKVNSNVA VARSQGRYTTNDGYVFNPADI I EDTGNA 


160 


BVH3 


SP64 


121 


AAHADNVRTKDEINRQKQEHV^NEKVNSNVAVARSQGRYTTNWYVFNPADIIEDTGNA 


180 


BVH3 


P4241 


121 


AAHADWVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTTNDGYVFKPADI I EDTGNA 


160 


BVK3 


A66 


121 


AAHADNVRTKDEINRQKQEHVKDNEKVNSNVAVARSQGRYTnroGYVFNPAD 1 1 EDTGNA 


180 


BVH3 


WU2 


161 


YIVPHRGHYHYIPKSDLSASELAAAKAHIAGKNMQPSQ 


240 


BVH3 


RX1 


181 


YI VPHGGHYHYI PKSDLS AS ELAAAKAHLAGKNMQPSQLS YS STASDNNTQSVAKG STS K 


240 


BVH3 JNR7/87 


181 


YIVPHGGHYHYIPKSDLSASELAAAKAHIAGKNMQPSQI^YSSTASDNNTQSVAKGSTSK 


240 


BVH3 


SP64 


181 


YIVPHGGHYHYIPKSDLSASEIAAAKAHUGKNMQPSQLSYSSTASDNNTQSVAKGSTSK 


240 


BVH3 


P4241 


181 


YIVPHRGHYHYIPKSDLSASELAAAKAHIiAGKNMQPSQLSYSSTASDNNTQSVAKGSTSK 


240 


BVH3 


A66 


181 


YIVPHRGHYHYIPKSDLSASELAAAKAHLAGKNMQPSQLSYSSTASDNNTQSVAKGSTSK 


240 


BVH3 


WU2 


241 


PANKSENLQSLLKELYDS PSAQRYSESDGLVFDPAKI I SRTPNGVAI PHGDHYHF I PYSK 


300 


BVH3 


RX1 


241 


PANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHFIPYSK 


300 


BVH3 JNR7/87 


241 


PANKSENLQSLLKELYDS PSAQRYSESDGLVFDPAKI I SRTPNGVAI PHGDHYHF I PYSK 


300 


BVH3 


SP64 


241 


PANKSENLQSLLKELYDS PSAQRYSESDGLVFDPAKI I SRTPNGVAI PHGDHYHFI PYSK 


300 


BVH3 


P4241 


241 


PANKSENLQSLLKELYDS PSAQRYSESDGLVFDPAKI I SRTPNGVAI PHGDHYHF I PYSK 


300 


BVH3 


A66 


241 


PANKSENLQSLLKELYDS PSAQRYSESDGLVFDPAKI I SRTPNGVAI PHGDHYHFI PYSK 


300 


BVH3 


WU2 


301 LSALEEKIARMVPISGTGSTVSTNAKPMEWSSLGSLSSNPSSLTTSKELSSASDGYIFN 


360 


BVH3 


RX1 


3 01 LS ALEEKI ARRVP I SGTG STVSTNAKPNEWS S LGS LSSN PSS LTTS KELS SASDGY I FN 


360 


BVH3 JKR7/87 


301 1^ ALEEKI ARMVPISGTGSTVSTh^PNEWSSLGSLSSNPSSLTTSKELSSASDGYI FN 


360 


BVH3 


SP64 


301 LS ALEEKI ARMVP I SGTGSTVSTNAKPKEVVSSLGSLSSN PSS LTTS KELS SASDGY I FN 


360 


BVH3 


P4241 


301 


LSALEEKIARMVPISGTGSTVSTNAKPNEWSSLGSLSSNPSSLTTSKELSSASDGYIFN 


360 


BVH3 


A66 


301 


LSALEEKIARMVPISGTGSTVSTNAKPNEWSSLGSLSSNPSSLTTSKELSSASDGYIFN 


360 


BVH3 


WU2 


361 


PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEED 


420 


BVH3 


RX1 


361 


PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSIATPSPSLPINPGISHEKHEED 


420 


BVH3 JNR7/87 


361 


PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEED 


420 


BVH3 


SP64 


361 


PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEED 


420 


BVH3 


P4241 


361 


PKD I VEETAT A YI VRHGDH FHY I PKS NQ IGQ PTLPNNS LAT PS PSLP INPGTSHEKHEED 


420 


BVH3 


A66 


361 


PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSIATPSPSLPINPGTSHEKKEED 


420 


BVH3 


WU2 


421 


GYGFDANRIIAEDESGFWSHGDHNHYFFKXDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 


480 


BVH3 


RX1 


421 


GYGFDANRI IAEDESGFIMSHGNHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 


480 


BVH3 JNR7/87 


421 GYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 


480 


BVH3 


SP64 


421 


GYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 


480 


BVH3 


P4241 


421 


GYGFDANRI I AED ESG FVMS HGDHNH Y F F KKD LTE EQ I KAAQKHLEEVKTSHNGLDSLSS 


480 


BVH3 


A66 ' 


421 


GYGFDANRI I AEDESGFVMS HGDHNH YFFKKDLTEEQI KAAQKHLEEVKTSHNGLDSLSS 


480 
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BVH3 WU2 481 HBQDYPSNAKBMKDLDKXXEEKIACINXQYGVKRESIWNKEKNAIXYPHGDHHHADPID S4 0 

BVH3 RX1 481 HEQDYPGNAKEHKDLDKKIEEKIAGIMKQYGVKRESIV\WKEKNAIIYPHGDHHHADPID S4 0 

BVH3 JNR7/87 481 HEQDYPSNAKEMKDLDKKIEEKIAGIMKQYGVKRESIWNKEKNAIIYPHGDHHHADPID 54 0 

BVH3 SP64 4 81 H EQDYPGNAKEMXDLDKKI EE K I AG IMKQYG VKRES I WNKEKNAI I Y PHGOHHHAD P I D 54 0 

BVH3 P4241 4 81 HEQDYPSNAKEMKDLDKKI EEK I AG I MKQYGVKRES I WNKEKNAI I YPHGDHHHADP I D 540 

BVH3 A66 481 HEQDYPSNAKEMKDLDKKI EEKIAGIMKQYGVKRES I WNKEKNAI 1 YPHGDHHHADP ID 540 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



541 EHKPVGIGHSHSNYELFKPEEGVAXKEGNK\nTO 600 

541 EHK P VG I GHSHSNYELF JCP EEGVAKKEGNKVYTG E ELTNVVNLLKNST FNNONFTLANGQ 600 

541 EHKPVGICTSHSNYBIJICPEEGVAKICEGNKVYTGEEL^ 60 o 

541 EHKPVGIGHSHSNYEXFJCPEEGVAKKEGincVYTGEELTN^ 60 0 

S41 EHKPVG IGHSHSNYELFKPEEGVAKKEGNKVYTG EELTNVVNLLKNSTFNNQNFTLANGQ 600 

541 EHICPVGIGHSHSNYELFKPEEGVAKKEGNTCVYTGEELTNVV^ 600 



BVH3 WU2 
BVK3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



601 lOlVSFSFPPELEKKMlNMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQP 660 

601 KRVSPSFPPEI^KKLGINMLVKLITPDGKVI^KVSGKVFGEGV^ 660 

601 lOWSFSFPPELEKKIjGINMLVKLITPDGKVI^KVSGKVPGEG 660 

601 KRVSFSFPPELEKKLGXNMLVKXiXTPDGKVLEKVSGKVFGEGVG^ 660 

601 KRVSFSFPPEI*EKKXX3INMLV1CLITPDGKVIXKVSGKVFGEGVGNIANF 660 

601 KRVSFSFPPEI£KKIX5INMLVia*ITPDGlCVI^F^^ 660 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 5P64 
BVH3 P4241 
BVH3 A66 



661 FKYTIASKDYPEVSYDGTFTVPTSIAYKMASQTI FYPFHAGDTYI*RVNPQFAVPKGTDAL 720 

661 FKYTIASKDYPETVSYDGTFTVPTSLAYKMASQTiro^ 720 

661 FKYTIASKDYPEVSYDGTFTVirrSLAYKMASGTIFYPFTlAGDTYLRVNPQFAV 72 0 

661 FKYTIASKDYPEVSYDGTFTVPTSIAYKMASQTI FYPFHAGDTYLRVNPQFAVPKGTDAL 720 

661 FKYTXASKDYPEVSYDGTFTVPTSLAYKMASQTI FYPFHAGDTYLRVNPQFAVPKGTDAL 720 

661 FICrriASKDYPEVSYDGTFTVPTSLAYKMASQTI FYPFTiAGDTYLRVNPQFAVPKGTDAL 720 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4 241 
BVH3 A66 



721 VRVFDEFHGNAYI*ENNYKVGE X KLPI PKLNQGTTRTAGNKI PVTFMANAYLDNQSTYIVE 780 

721 VRVFDE FHGNAYLENNY KVGE I KLPI PKLNQGTTRTAGNKI PVTFMANAYLDNQSTYIVE 780 

721 VRVFDEFHGNAYLENNYXVGEI KLP I PKLNQGTTRTAGNKI PVTFMANAYLDNQSTYIVE 780 

721 VRVFDEFKGNAYLENNYXVGEIKLPIPKI^QGTTRTAGNKIPVTFMANAYIJDNQS^ 780 

721 VR VFD EFHGNAYLENNYKVG E I KL P I P KLNQGTTRTAGNK I PVT FMANA YLDNQSTY I VE 780 

721 VRVFDEFHGNAYLENNYKVGEIKLPI PKIjNQGTTRTAGNKI PVTFMANAYLDNQSTYIVE 780 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



781 VPILEKENGTDKPSILPQFKRNKAQENSKFDEKVEEPKTSEKVEKEKLS 8 40 

781 VP X LEKENQTDKPS I LPQFKRNKAQENS IQLDEKVEEPKTSEKVEKEKLSETGNSTSNSTL 840 

781 VTILEKENQ/TOKPSILPQFKRNKAQENIJUjDEKVEEPKTSEKVEKE 840 

781 VP X LEKENQTDKPS X I*PQFKRNXAQENSK1^EKVZE PKTSEKVEKEKLSETGNSTSNSTL 840 

781 VP I LEKENQTDKPSI LPQFKRNKAQENS KFDEKVEE PKTSEKVEKEKLSETGNSTSNSTL 840 

781 VPILEKENQTDKPSILPQFKRNKAQENSKFDEKVTEPKTSEKVEKEia^ETC 840 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



841 EEVPTWPVQEICVAKFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMAIJFTGEAPQGN 900 

841 EEVPTVDPVQEKVAKFAESYGMKLENVLFNMIXnT ELYLPSGEVI KKNMADFTCE 900 

841 EEVPTVDPVQEICVAJCFAESYGMKLENVLFNMDGTIELYLPSGEVIKKNMADFTC 900 

841 EEVPTVDPVQEKVAKFAESYGMKLENVL FNMDGT I ELYLPSGEVI KKNMADFTGEAPQGN 900 

841 EEVPTVDPVQEKVAKFAESYGMXLENVLFNMDGTIELYLPSGF/VI 900 

841 EEVPTVDPVQEKVAXFAESYGMKLENVLFNMDGT I ELYLPSGEVI KKNMADFTGEAPQGN 900 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
BVH3 A66 



901 GENXPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVXPENSTDNGMLNPEGNVGSDPM 960 

901 GENKPSF^GKVSTGTVENQPTENXPADSLPEAPN^KPVKPENSTDNGMLNPEGNVGSDPM 960 

901 G ENXPSENGKV STGTV ENQ PTENKPADS L PEAPNEKPVKP ENSTDNGMLN PEGNVGSD PM 960 

901 GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 960 

901 GENKPSENGKVSTGTVENQPTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPM 960 

901 G ENKPSENGKVSTGTVENQ PTENKPADS LPEAPNEKPVKPENSTONGMLNPEGNVGSDPM 960 



BVH3 WU2 
BVH3 RX1 
BVH3 JNR7/87 
BVH3 SP64 
BVH3 P4241 
SVH3 A66 



961 LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVI FNMDGTI ELRLPSGEVIKKNLSDLI A 1019 

961 LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVI FNMDGTI ELRLPSGEVIKKNLSDLI A 1019 

961 LD P ALEEAPA VD PVQEKLEKFT AS YG LG LDSV I FNMDGT X ELRLP SGEV I KKNLSDLI A 1019 

961 LD P ALEEAPA VDPVQEKLEKFT AS YGLG LDS V I FNMDGT X ELRLPSGEV I KKNLSDFX A 1019 

961 LDPALEEAPAVDPVQEKLEKFTASYGLGLDSVI FNMDGTI ELRLPSGEVIKKNLSDLI A 1019 

961 LD PALEEAPA VDPVQEKLEKFT AS YGLG LDSVI FNMDGT I ELRLPSGEVI KKNLSDLI A 1019 
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BVHll-2 SP64 1 CS YE LGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVHll-2 JNR7/87 1 CSYBI/jRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQI VI KITDQGY 60 

BVHll-2 P4241 1 CSYEUiRHOftaODKKESKRVAYIDGOQAGQKAENLTPDEVSKREGINAEQIVIKITDQGy 60 

BVHll-2 A66 1 CSYELGRHQACQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDOGY 60 

BVHll-2 WU2 1 CSYBLGRHQAGQDKKESNRVAYI DGDQAGQKAENLTPDEVSKREG INAEQI VI KITDQGY 60 

BVHll-2 Rxl 1 CSYELGRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11 P4241 1 CSYEU5RHQAGQDKKESNRVAYI DGDQAGQKAENLTPDEVSKREG INAEQI VI KITDQGY 60 

BVH11 WU2 1 CSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVI KITDQGY 60 

BVH11 A66 1 CSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREG INAEQI VI KITDQGY 60 

BVH11 Rxl 1 CSYEWRHQAGQVKKESNRVSYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQGY 60 

BVH11 JNR7/67 1 CSYEI/3RHQAGQDKKBSNRVAY I DGDQAGQKAENLTPDEVSKREG INAEQI VI KITDQGY 60 

BVH11 SP63 1 CSY£UjRHQAGQVKKE5HRVSYIDGDQAGQKAENLTPDEVSKREGINAEQI VI KITDQGY 60 

BVK11 SP64 1 CAYELGLHQA-QTVKENNRVSYIDGKQATQKTENLTPDEVSKREGINAEQIVIKITDQGY 59 



BVHll-2 SP64 61 VTSHGDHYHYYKGKVPYD AI I S EE LLMKD PNYQLKDSD I VKE I KGG YVI KVDGKYYVYLK 120 

BVHll-2 JNR7/87 61 VTSHGDHYHYYNGKVPYDAI I SEELLMKDPNYQLKDSDIVNE I KGGYVI KVDGKYYVYLK 120 

BVHll-2 P4241 61 VTSHGDHYHYYNGKVPYDAI ISEEUuMKIJPWYQLKDSD I miKCW YVI KVNGKYYVYLK 120 

BVHll-2 A66 61 VTSHGDHYHYYNGKVPYDAI ISEEUilKDPNYQUCDSD I VNE I KGG WI KVNGKYYVYLK 120 

BVHll-2 WU2 61 VTSHGDHYHYYNGKVPYDAI ISEELLMKDPNYQLKDSDI\^IKGGWI KVNGKYYVYLK 120 

BVHll-2 Rxl 61 VTSHGDHYHYYNGKVPYDAI ISEEI^KDPNYQLKDSDIVNEIKGGYVIKVDGKY^ 120 

BVH11 P4241 61 VTSHGDHYHYYNGKVPYDAI I SEELLMKDPNYQLKDSDIVNE I KGGYVI KVNGKYYVYLK 120 

BVH11 WU2 61 VTSHGDHYHYYNGKVPYDAI I SEELLMKDPNYQLKDSDIVNEI KGGYVI KVNGKYYGYLK 120 

BVH11 A66 61 VTSHGDHYHYYNGKVPYDAI I SEELIilKDPNYQLKDSD I VNE I KGGYVI KVNGKYYVYLK 120 

BVH11 Rxl 61 VTSHGDHYHYYNGKVPYDAI I SEELLMKDPNYQLKDSDIVNE I KGGYVI KVDGKYYVYLK 120 

BVH11 JNR7/87 61 VTSHGDHYHYYNGKVPYDAI I SEELLMKDPNYQLKDSDIVNE I KGG YVI KVNGKYYVYLK 120 

BVH11 SP63 61 VTSHGDHYHYYNGKVPYDAI I SEELLMKDPNYQLKDSDIVNE I KGGYVI KVDGKYYVYLK 120 

BVH11 SP64 60 VTSHGDHYHYYNGKVPYDAI I SEELLMKDPNYQLKDSDIVNE I KGG YVI KVNGKYYVYLK 119 



BVHll-2 SP64 121 DAAHADNIRTKEEIKRQKQEHSKNHNSRA DNAVAAARAQGRYTTDDGYI FNASDI I E 177 

BVHll-2 JNR7/87 121 DAAKADN I RTKEEI KRQKQBHSKNHGGGSN - - DQAWAARAQGRYTTDDGYI FNASDI I E 178 

BVHll-2 P4241 121 DAAHADN I RTKEE I KRQKQEHSHNHGGGSN - - DQA WAARAQGR YTTDDG Y I FNASD HE 178 

BVHll-2 A66 121 DAAHADNI RTKEEIKRQRQEHSHNHGGGSN - - DQAWAARAQGRYTTDDGYI FNASD HE 178 

BVHll-2 WU2 121 DAAHADN I RTKEE I KRQ KQEHSHNHGGG SN - - DQA W AARAQGRYTTDDG Y I FN AS DUE 178 

BVHll-2 Rxl 121 DAAHADN I RTKEE I KRQKQERSHNHNS RA DNAVAAARAQGRYTTDDGYI FNASD HE 177 

BVH11 P4241 121 DAAHADNI RTKEE I KRQKQEHSHNHGGGSN - - DQAWAARAQGRYTTDDGYI FNASDI I E 178 

BVH11 W02 121 DAAHADNI RTKEE I KRQKQEHSHNHGGGSN - - DQAWAARAQGRYTTDDGYI FNASD HE 178 

BVH11 A66 121 DAAHADNI RTKEE I KRQKQEHSHNHGGGSN - - DQAWAARAQGRYTTDDGYI FNASD 1 1 E 178 

BVH11 Rxl 121 DAAHADNI RTKEE I KRQKQERSKNKNSRA DNAVAAARAQGRYTTDDGYI FNASD HE 177 

BVH11 JNR7/87 121 DAAHADNI RTKEE I KRQKQERSHNHNSRA * - - DNAVAAARAQGRYTTDDGYI FNASD 1 1 E 177 

BVH11 SP63 121 DAAHADNI RTKEE I KRQKQERSHNHNSRA DNAVAAARAQGRYTTDDGYI FNASD HE 177 

BVH11 SP64 120 DAAHADNVRTKEE I NRQKQEHSQHREGGTS ANDGAVAFARSQGRYTTDDG Y I FNASD HE 179 



BVHll-2 SP64 178 DTGDAYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPSSSSSYNANPVQPRLSEN 237 

BVHll-2 JNR7/87 179 DTGDAYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPSSSSSYNANPAQPRLSEN 238 

BVHll-2 P4241 179 DTGDAYIVPHGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 238 

BVHll-2 A66 179 DTGDAYIVPHGNHFHYI PKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 238 

BVHll-2 WU2 179 DTGDAYIVPRGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 238 

BVHll-2 Rxl 178 DTGDAYIVPHGDHYHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 237 

BVH11 P4241 179 DTGDAYIVPHGNHFHYI PKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLS EN 238 

BVH11 WU2 179 DTGDAYIVPHGNHFHYI PKSDLS AS BLAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 238 

BVH11 A66 179 DTGDAYIVPHGNHFHYI PKSDLS AS ELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLS EN 238 

BVH11 Rxl 178 DTGDAYIVPHGDHYHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 237 

BVH11 JNR7/87 178 DTGDAYIVPHGDHYHYIPKNELSASELAAAEAYWNGKQGSRPSSSSSYNANPAQPRLSEN 237 

BVH11 SP63 178 DTGDAYIVPHGNHFHYIPKSDLSASELAAAQAYWNGKQGSRPSSSSSHNANPAQPRLSEN 237 

BVH11 SP64 180 DTGDAYIVPHGDHYHYIPKNELSASELAAAEAFLSGRENLSNLRTYRRQNSDNTPRTNWV 239 



10/33 



WO 00/39299 



PCT/CA99/01218 



BVHll-2 SP64 238 KNLTVTPTYHQN QGENI SSLLRELYAKPLSERHVESDGLI FDPAQITS 285 

BVH11-2 JNR7/87 239 KNLTVTPTYHQN QGENI SSLLRELYAKPLSERHVESDGLI FDPAQITS 266 

BVHX1-2 P4241 239 HNLTVTPTYHQN QGENI SSLLRELYAKPLSERHVESDGLI FDPAQITS 266 

BVHll-2 A66 239 HNLTVTPTYHQN QGENI SSLLRELYAKPLSERHVESDGLI FDPAQITS 286 

BVHH-2 WU2 239 HNLTVTPTYHQN QGENISSLLRELYAKPLSERRVESDGLI FDPAQITS 286 

BVHll-2 Rxl 238 KNLTVTPTYHQN QGENISSLLRELYAKPLSERHVESDGLI FDPAQITS 285 

BVH11 P4241 239 HNLTVTPTYHQN - QGENISSLLRELYAKPLSERHVESDGLI FDPAQITS 286 

BVH11 WU2 239 HNLTVTPTYHQN QGENISSLLRELYAKPLSERHVESDGLI FDPAQITS 286 

BVH11 A66 239 HNLTVTPTYHQN QGENISSLLRELYAKPLSERHVESDGLI FDPAQITS 286 

BVH11 Rxl 238 HNLTVTPTYHQN QGENISSLLRELYAKPLSERHVESDGLI FDPAQITS 285 

BVH11 JNR7/87 238 KNLTVTPTYHQN - QGENISSLLRELYAKPLSERHVESDGLI FDPAQITS 285 

BVH11 SP63 238 KNLTVTPTYHQN QGENISSLLRELYAKPLSERHVESDGLI FDPAQITS 265 

BVH11 SP64 240 PSVSNPGTTNTNTSNNSNTNSQASQSNDIDSLLKQLYICLPLSQRHVESDGL I FDPAQITS 299 



BVH11 
BVH11- 
BVH11- 
BVH11- 
BVH11- 
BVH11- 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 



2 SP64 

2 JNR7/87 

2 P4241 

2 A66 

2 WU2 

2 Rxl 

P4241 

WU2 

A66 

Rxl 

JNR7/87 
SP63 
SP64 



286 RTARGVAVPHGNHYHFI PYEQKSELEKRIARI I PLRYRSNHWVPDSRPEQPS PQSTPEPS 345 

287 RTARGVAVPHGNHYHFI PYEQMSELEKRIARI I PLRYRSNHWVPDSRPEQPS PQSTPEPS 346 

287 RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPS PQ PS 342 

287 RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPS PQ PS 342 

287 RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPS PQ PS 342 

286 RTANGVAVPHGDHYHF I PYSQLSPLEEKLAR I I PLRYRSNHWVPDSRPEQPS PQSTPEPS 345 

287 RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPSPQ PS 342 

287 RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPSPQ PS 34 2 

287 RTARGVAVPHGNHYHFI PYEQMSELEERIARI I PLRYRSNHWVPDSRPEQPSPQ PS 342 

286 RTANGVAVPHGDHYHF I PYSQLSPLEEKLARI I PLRYRSNHWVPDSRPEQPSPQSTPEPS 345 
286 RTARGVAVPHGNHYHFI PYEQMSELEKRIARIIPLRYRSNHWVPDSRPEEPSPQPTPEPS 345 
286 RTARGVAVPHGNHYHFI PYSQMSELEERIARI I PLRYRSNHWVPDSRPEQPSPQSTPEPS 34 5 
300 RTARGVAVPHGNHYHFI PYEQMSELEKRIARI I PLRYRSNHWVPDSRPEEPSPQPTPE PS 359 



BVH11 


-2 SP64 


346 


BVH11 


-2 JNR7/87 


347 


BVH11 


-2 P4241 


343 


BVH11 


-2 A66 


343 


BVH11 


-2 WU2 


343 


BVH11 


-2 Rxl 


346 


BVH11 


P4241 


343 


BVH11 


WU2 


343 


BVH11 


A66 


343 


BVH11 


Rxl 


346 


BVH11 


JNR7/87 


346 


BVH11 


SP63 


346 


BVH11 


SP64 


360 



BVHll-2 SP64 406 LAKQES LSHKLGAKKTDLPSSDRE FYNKAYDLLARI HQDLLDNKGRQVDFEVLDNLLERL 465 

BVHll-2 JNR7/87 407 L^QESI^HKI^AWCTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERL 466 

BVHll-2 P4241 403 LAKQESLSHKICTICKTDLPSSDREFYNKAYDLIJ^ 462 

BVHll-2 A66 403 LAKQES L5 HKLGTKKTDLPS SDRE FYNKAYDLLAR I HQDLLDNKGRQVDFEALDNLLERL 462 

BVHll-2 WU2 403 LAKQESLSHKIiGTKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEA 462 

BVHll-2 Rxl 406 LAKQES LSHKI/3AlCiCTDLPSSDREFYN1^YDLLAR I HQDLIJ3NKGRQVDFE 465 

BVH11 P4241 403 lAKQESLSHKLGTKKTDLPSSDREPYNKAYDLLARIHQDLI^tlKGRQV^ 462 

BVH11 WU2 403 LAKQES LSHKI^TKKTDLPSSDREFYNKAYDLLAR I HQDLLDNKGR^ 462 

BVH11 A66 403 1AKQESL^HKI/3TKKTDLPSSDREFYNKAYDLLARIHQDLU3NKGRQVDFEALDNLLERL 462 

BVH11 Rxl 406 IAKQESI^HKI^AKJOTILPSSDREFYNKAYDLIARIHQDLLDNKGRQVDFEALD 465 

BVH11 JNR7/87 400 LAKQES LSHKLGAKKTDLPS SDRE FYNKAYDLLAR I HQDLLDNKGRQVDFEALDNLLERL 459 

BVH11 SP63 4 06 LAKQESLSHKIX3AKJCTDLPSSDREFYNKAYDLLAR I HQDLLDNKGRQVDFEALDNLLERL 46S 

BVH11 SP64 420 LAKQES LSHKLGAKKTDLPS SDRE FYNKA YD LLAR I HQDLLDNKGRQVDFEALDNLLERL 4 79 
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BVH11-2 SP64 466 KDVSSDKVKLVDDILAFIAPIRHPERLGKPNAQITTTO S25 

BVH11-2 JNR7/87 467 ICDVPSDKViaVDDIIAFIAPIRHPERLGKPNAQITYTO^^ S26 

BVH11-2 P4241 463 KDVSSDKVKLVEDILAFIAPIRHPERX^CKPNSQITYTDDEIOVAKIAGKYTTEDGY^ 522 

BVH11-2 A66 463 KDVSSDKVKLVEDIIJVFLAPIRHPER1/3KPNSQITYTDDEIQVAK1AGKYTTEDGYIFDP 522 

BVH11-2 WU2 463 KDVSSDKVKLVEDIUtfOAPIRHPERIXSKPNSQITYTDDEIQVAKI^ 522 

BVH11-2 Rxl 466 KDVSSDKVKLVDDILAFIAPIRHPERLGKPHAQITYTDDEIQVAJCL^ 525 

BVH11 P4241 463 KDVSSDKVKLVEDILAFLAPIRHPERLGKPNSQITYTDDEIOVAKIJ^KYTTEIW 522 

BVH11 WU2 463 KDVSSDKVKLVEDIIJVFIjAPIIWPERIXjKPNSQITYTDDEIOVAKIJVGKYTTEDGYIFDP 522 

BVH11 A66 463 KDVSSDKVKLVTTCILAFIAPIRHPERLGKPNSQITYTDDEIQVA^ 522 

BVH11 Rxl 466 KDVSSDKVKLVDDI LAFIAP IRWPERLGKPNAQI TYTDDE I QVAKLAGKYTTEDG Y IFDP 525 

BVH11 JNR7/87 460 KDVSSDKVKLVDDIIAFIAPIRilPERLGKPNAQITYTDDEIQVAKlJUSKYTTEDGYIFDP 519 

BVH11 SP63 466 EDVPSDKVKLVDDIUVFIAPIRHPERLGKPKAQITYTDDEIQVAKLAGKYTTEDGYIFOP 525 

BVH11 SP64 480 KDVSSDKVKLVDDILAFLAPIRHPERLGKPNAQITYTDDEIQVAKLA^ 539 



BVH11-2 SP64 526 RDITSDEGDAYVTPHKTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 585 

BVH11-2 JNR7/87 527 R^ITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAOAYAKEKGLTPPSTDHQDSGNTEAK 586 

BVH11-2 P4241 523 RDI TSDEGDA YVT PHKTHSHW I KKDSLS EAERAAAQAYAKEKGLTP PSTDHRDSGNTEAK 582 

BVH11-2 A66 523 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHODSGNTEAK 582 

BVH11-2 WU2 523 RDITSDEGDAYVTPHMTHSHWI KKDSLS EAERAAAQAYAJCEKGLTPPSTDHQDSGNTEAK 582 

BVH11-2 Rxl 526 RD ITSDEGDA YVT PHKTHSHW I KKDS LS EAERAAAQAYAKEKGLT P PSTDHQDSGNTEAK 585 

BVH11 P4241 523 ROITSDEGDAYVTPHMTOSHWIKKDSI^EAERAAAQAYAKEKGLTPPSTOHQDSGMTEAK 582 

BVH11 WU2 S23 RD I TSDEGDAYVTPHfn*HSHW I KKDS I^EAERAAAQAYAKEKGLTP PSTDHQDSGNTEAK 582 

BVH11 A66 S23 RDITSDEGDAYVTPHMTHSHWI KKDSLS EAERAAAQAYAKEKGLTP PSTDHQDSGNTEAK 582 

BVH11 Rxl 526 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 585 

BVH11 JNR7/87 520 RDITSDEGDAYVTPHMTHSHW I KKDS LSEAERAAAQAYAKEKGLTP PSTDHQDSGNTEAK 579 

BVH11 SP63 526 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 585 

BVH11 SP64 540 RDITSDEGDAYVTPHMTHSHWIKKDSLSEAERAAAQAYAKEKGLTPPSTDHQDSGNTEAK 599 



BVH11-2 SP64 566 GAEAI YNRVKAAKKVPLDRMP YNLQYTVEVKNGSLI I PHYDHYHNI KFEWFDEGLYEAPK 645 

BVH11-2 JNR7/87 587 GAEAI YNRVKAAKKVPLDRM P YNLQYTVEVKNGSLI I PHYDHYHNIKFEWFDEGLYEAPK 646 

BVH11-2 P4241 583 GAEAI YNR VKAAKKV PLD RMPYNLQYTVEVKNGS LI I PHYDHYHNI KFEWFDEG LYEAPK 642 

BVH11-2 A66 583 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI I PHYDHYHNI KFEWFDEGLYEAPK 642 

BVH11-2 WU2 583 GAEAI YNRVKAAKKVPLDRMP YNLQYTVEVKNGSLI I PHYDHYHNIKFEWFDEGLYEAPK 642 

BVH11-2 Rxl 586 GAEAI YNRVKAAKKVPLDRMP YNLQYTVEVKNGSLI I 64 5 

BVH11 P4241 583 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI I PHYDHYHNIKFEWFDEGLYEAPK 642 

BVH11 WU2 583 GAEAI YNRVKAAKKVPU)RMPYNLQYTVEVKNGSLI I PHYDHYHNI KFEWFDEGLYEAPK 642 

BVH11 A66 583 GAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYD 642 

BVH11 Rxl 586 GAEAIYNRVKAAKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFEWFDE<3LY 645 

BVH11 JNR7/87 580 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI I PHYDHYHNIKFEWFDEGLYEAPK 639 

BVH11 SP63 586 GAEAI YNRVKAAKKVPLDRMPYNLQYTVEVKNGSLI I PHYDHYHNIKFEWFDEGLYEAPK 64 5 

BVH11 SP64 600 GAEAI YNRVKAAKKVPLDRMP YNLQYTVEVKNGSLI I PHYDHYHNI KFEWFDEGLYEAPK 659 



BVH11-2 SP64 64 6 GYSLEDLIATVKYYVEHPNERPHSDNGFGNASDHVRXNK ADQDSK 690 

BVH11-2 JNR7/87 64 7 GYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVRKNK VDQDSK 691 

BVH11-2 P4 241 64 3 GYTLEDLLATVKYYVEHPNERPHSDNGFGNASDHVRKNK ADQDSK 667 

BVH11-2 A66 64 3 G YTLEDLLATVKYYVEH PNER PHSDNG FGN ASDHVRKNK -- ADQDSK 687 

BVH11-2 WU2 64 3 GYTLEDLLATVKYYVEHPNER PHSDNGFGNASDHVRKNK ADQDSK 687 

BVH11-2 Rxl 64 6 GYSLEDLLATVKYYVEHPNERPHSDNGFGNASDHVQRNKNGQADTNQTEKPNEEKPQTEK 70S 

BVH11 P4241 643 G YTLEDLLATVKYYVEH PNERPHSDNGFGNASDHVRKNK -ADQDSK 687 

BVH11 WU2 643 GYTLEDLLATVKYYVEHPNER PHSDNGFGNASDHVRKNK ADQDSK 687 

BVH11 A66 643 GYTLEDLLATVKYYVEH PNERPHSDNGFGNASDHVRKNK ADQDSK 687 

BVH11 Rxl 646 GY S LED LLATVKYYVEH P NE R PH S DNG FGN AS DHVQRNK NGQ 687 

BVH11 JNR7/87 64 0 G YS LEDLLATVKYYVEH PNERPHSDNG FGNASDHVQRNK NGQ 681 

BVH11 SP63 646 GYTLEDLLATVKYYVEH PNER PHSDNG FGNASDHVQRNK NGQ 687 

BVH11 SP64 660 GYTLEDLLATVKYYVEH PNER PHSDNG FGNASDHVQRNK NGQ 701 
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BVH11- 
BVH11- 
BVHU- 
BVH11- 
BVH11- 
BVH11- 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 
BVH11 



2 SP64 

2 JNR7/87 

2 P4241 

2 A66 

2 WU2 

2 Rxl 

P4241 

WU2 

A66 

Rxl 

JNR7/87 
SP63 
SP64 



691 PDEDKEHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEA£DTTDEA£IPQV 7S0 

692 PDEDKEKDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEAEDTTDEAEI PQV 751 
6B8 PDEOKGHDEVSEPTHPESDBKENHAGI^PSADNLYKPSTDTEETEEEAEDTTDEAEIPOV 74 7 
668 PDEDKGHDEVSEPTHPESDEKENHAGLKPSADNLYKPSTDTEETEEEAEDTTDEAEI PQV 74 7 
688 PDBDKGHDEVS E PTH PESDE KENHAG PS ADNLYKPSTDTEBTEEEAEDTTDEAE I PQV 747 
706 PEEDKEHDEVSEPTHPESDEKENHVGUIPSADNLYKPSTDTEETEEEAEiyrTDEAEIPOV 765 
688 PDEDKGHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTEETEEEAEDTTDEA£IPQV 74 7 
688 POEOKGKOEVS E PTH P ESDE KENKAG Hi PSADNLYKP STDTBETEEEAEDTTDBAE I PQV 74 7 
688 PDEDKGHDEVSEPTHPESDEKENHAGLNPSADNLYKPSTDTBETEEEAEDTTOEAEIPQV 747 
688 ADTNQTEKPNEEKPQTEKPEEETPREEKPQSEKPESPKPTEBPEEESPEESPEESEEPQV 74 7 
682 ADTNQTEKFNEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEEES PEES P EES EEPQV 741 

688 ADTKQTEKPSEEKPQTEKPEEETPREEKPQSEKPESP KPTEEPEEESPEESEEPQV 743 

702 ADTNQTEKPSEEKPQTEKPEEETPRBEKPQSEKPESP KPTEEPEEESPEESEEPQV 757 



BVH11- 

BVH11- 

BVH11- 

BVH11- 

8VH11- 

BVH11 

BVH11 

BVH11 

BVH11 

BVH11 

BVH11 

BVH11 

BVH11 



2 SP64 

2 JNR7/87 

2 P4241 

2 A66 

2 WU2 

2 Rxl 

P4241 

WU2 

A66 

Rxl 

JNR7/87 
SP63 
SP64 



751 EKSVX NAKI ADAEALLEKVTDPS I RQNAMETLTGLKSSLLLGTKDNNTI S AEVDSLIALL 810 

752 ENSVINAKIADAEAI^KVTOPSIRQNAMETLTGLKSSI^^ 811 
748 EHSVINAKIADAEALLEKVTDPSIRQNAMCTLTGLKSS 8 07 
748 EHSVINAJQADAEALLEKVTDPSIRQNAMETLTCLKSSLLLGT^ 807 
748 EHSVINAXIADAEALLEKVTDPSIRQNAMETLTGLKSSLLLGTKDNin'ISAEV^ 807 
766 EYSVINAXIAEAEAlXEKVTDSSIRQNAVETLTGI^SLLLGTim 825 
748 EHSV I NAKI ADAEAIiLE KVTD P S I RQN AMETLTG LKS S LLLGTKDNNT I SAEVDS LLALL 807 
748 EHSVIHAKIADAEAI*IJ2KVTOPSIRQNAMETLTGLKSSI*IXGT^ 807 
748 EHSVINAXIADAEALLEKVTOPSIRQNAMETLTGIJ^ B 07 
748 ETEKVKEKIAEAEDLI£KIQNPIIKSNAKETLTCIJ(NNLLFO^ 807 
742 ETEKVKEICLREAEDLIXJKIQNPI IKSNAKETLTGIJCNNLL 8 01 
744 ETE KVE EKLREAEDLLG KI QD P 1 1 KSN AKETLTG LKNNLLFGTQDNNT I MAEAE KLLALL 803 
758 ETEKVEEKLREAEDLIXjKXQDP 1 1 KSNAKETLTGLKNNLLFGTQDNNTI MAEAE KLIALL 817 
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811 KESQPAPIQ 819 

812 KESQPAPIQ 820 
808 KKSQPAPIQ 816 
808 KKSQPAPIQ 816 
808 KKSQPAPIQ 816 
826 KESQPAPIQ 834 



808 KESK 
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808 KESK 

802 KESK 

804 KESK 

616 KESK 
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AATTCCTTGT CGGGTAAGTT CCGACCCGCA CGAAAGGCGT AATGATTTGG GCACTGTCTC 60 

AACGAGAGAC TCGGTGAAAT TTTAGTACCT GTGAAGATGC AGGTTACCCG CGACAGGACG 120 

GAAAGACCCC ATGGAGCTTT ACTGCAGTTT GATATTGAGT GTCTGTACCA CATGTACAGG 180 

ATAGGTAGGA GTCTAAGAGA TCGGGACGCC AGTTTCGAAG GAGACGCTGT TGGGATACTA 24 0 

CCCTTGTGTT ATGGCCACTC TAACCCAGAT AGGTGATCCC TATCGGAGAC AGTGTCTGAC 300 

GGGCAGTTTG ACTGGGGCGG TCGCCTCCTA AAAGGTAACG GAGGCGCCCA AAGGTTCCCT 360 

CAGAATGGTT GGAAATCATT CGCAGAGTGT AAAGGTATAA GGGAGCTTGA CTGCGAGAGC 420 

TACAACTCGA GCAGGGACGA AAGTCGGGCT TAGTGATCCG GTGGTTCCGT ATGGAAGGGC 480 

CATCGCTCAA CGGATAAAAG CTACCCTGGG GATAACAGGC TTATCTCCCC CAAGAGTTCA 540 

CATCGACGGG GAGGTTTGGC ACCTCGATGT CGGCTCGTCG CATCCTGGGG CTGTAGTCGG 600 

TCCCAAGGGT TGGGCTGTTC GCCCATTAAA GCGGCACGCG AGCTGGGTTC AGAACGTCGT 660 

GAGACAGTTC GGTCCCTATC CGTCGCGGGC GTAGGAAATT TGAGAGGATC TGCTCCTAGT 720 

ACGAGAGGAC CAGAGTGGAC TTACCGCTGG TGTACCAGTT GTCTTGCCAA AGGCATCGCT 780 

GGGTAGCTAT GTAGGGAAGG GATAAACGCT GAAAGCATCT AAGTGTGAAA CCCACCTCAA 840 

GATGAGATTT CCCATGATTA TATATCAGTA AGAGCCCTGA GAGATGATCA GGTAGATAGG 900 

TTAGAAGTGG AAGTGTGGCG ACACATGTAG CGGACTAATA CTAATAGCTC GAGGACTTAT 960 

CCAAAGTAAC TGAGAATATG AAAGCGAACG GTTTTCTTAA ATTGAATAGA TATTCAATTT 1020 

TGAGTAGGTA TTACTCAGAG TTAAGTGACG ATAGCCTAGG AGATACACCT- GTACCCATGC 1080 

CGAACACAGA AGTTAAGCCC TAGAACGCCG GAAGTAGTTG GGGGTTGCCC CCTGTGAGAT 1140 

AGGGAAGTCG CTTAG CTCTA GGGAGTTTAG CTCAGCTGGG AGAGCATCTG CCTTACAAGC 1200 

AG AG GGTCAG CGGTTCGATC CCGTTAACTC CCAAAGGTCC CGTAGTGTAG CGGTTATCAC 1260 

GTCGCCCTGT CACGGCGAAG ATCGCGGGTT CGATTCCCGT CGGGACCGTT TAAGGTAACG 132 0 

CAAGTTATTT TAGACTCGTT AGCTCAGTTG GTAGAGCAAT TGACTTTTAA TCAATGGGTC 1380 

ACTGGTTCGA GCCCAGTACG GGTCATATAT GCGGGTTTGG CGGAATTCTA ATCTCTTTGA 1440 

AATCATCTTC TCTCACTTTC CAAAACTCTA TTACCTCTTA TTATACCACA TTTCAATCTT 1500 

CAACTTCCCA GTAATATAAG CACCTCTGGC GAAAGAAGTT TCAATGTCCT AAAGTAATAA 1560 

GTGAATCCAA TTCAGGAACT CCAAGAACAA AAGAAACATC TGGTGTCACA AGTATTGGAT 1620 

GGCACAGAGT CACGTGGTAG TCTGACCCTA GCAGAAATTT TAAATAGTAA ACTATTTACT 1680 

GGTTAATTAA ATGGTTAAAT AACCGGTTTA GAAAACTATT TAATAAAGTA AAAGAAGTTG 1740 

AGAAAAAACT TCATCATTTA TTGAAATGAG GGATTTATGA AATTTAGTAA AAAATATATA 1800 

GCAGCTGGAT CAGCTGTTAT CGTATCCTTG AGTCTATGTG CCTATGCACT AAACCAGCAT I860 

CGTTCGCAGG AAAATAAGGA CAATAATCGT GTCTCTTATG TGGATGGCAG CCAGTCAAGT 1920 

CAGAAAAGTG AAAACTTGAC ACCAGACCAG GTTAGCCAGA AAGAAGGAAT TCAGGCTGAG 1980 

CAAATTGTAA TCAAAATTAC AGATCAGGGC TATGTAACGT CACACGGTGA CCACTATCAT 2040 

TACTATAATG GGAAAGTTCC TTATGATGCC CTCTTTAGTG AAGAACTCTT GATGAAGGAT 2100 

CCAAACTATC AACTTAAAGA CGCTGATATT GTCAATGAAG TCAAGGGTGG TTATATCATC 2160 

AAGGTCGATG GAAAATATTA TGTCTACCTG AAAGATGCAG CTCATGCTGA TAATGTTCGA 2220 

ACTAAAGATG AAATCAATCG TCAAAAACAA GAACATGTCA AAGATAATGA GAAGGTTAAC 2280 

TCTAATGTTG CTGTAGCAAG GTCTCAGGGA CGATATACGA CAAATGATGG TTATGTCTTT 234 0 

AATCCAGCTG ATATTATCGA AGATACGGGT AATGCTTATA TCGTTCCTCA TGGAGGTCAC 2400 

TATCACTACA TTCCCAAAAG CGATTTATCT GCTAGTGAAT TAGCAGCAGC TAAAGCACAT 2460 

CTGGCTGGAA AAAATATGCA ACCGAGTCAG TTAAGCTATT CTTCAACAGC TAGTOACAAT 2520 

AACACGCAAT CTGTAGCAAA AGGATCAACT AGCAAGCCAG CAAATAAATC TGAAAATCTC 2580 

CAGAGTCTTT TGAAGGAACT CTATGATTCA CCTAGCGCCC AACGTTACAG TGAATCAGAT 264 0 

GGCCTGGTCT TTGACCCTGC TAAGATTATC AGTCGTACAC CAAATGGAGT TGCGATTCCG 2700 

CATGGCGACC ATTACCACTT TATTCCTTAC AGCAAGCTTT CTGCTTTAGA AGAAAAGATT 2760 

GCCAGAATGG TGCCTATCAG TGGAACTGGT TCTACAGTTT CTACAAATGC AAAACCTAAT 2820 

GAAGTAGTGT CTAGTCTAGG CAGTCTTTCA AGCAATCCTT CTTCTTTAAC GACAAGTAAG 2880 

GAGCTCTCTT CAGCATCTGA TGGTTATATT TTTAATCCAA AAGATATCGT TGAAGAAACG 2940 

GCTACAGQTT ATATTGTAAG ACATGGTGAT CATTTCCATT ACATTCCAAA ATCAAATCAA 3000 

ATTGGGCAAC CGACTCTTCC AAACAATAGT CTAGCAACAC CTTCTCCATC TCTTCCAATC 3060 

AATCCAGGAA CTTCACATGA GAAACATGAA GAAGATGGAT ACGGATTTGA TGCTAATCGT 3120 

ATTATCGCTG AAGATGAATC AGGTTTTGTC ATGAGTCACG GAGACCACAA TCATTATTTC 3180 

TTCAAGAAGG ACTTGACAGA AGAGCAAATT AAGGCTGCGC AAAAACATTT AGAGGAAGTT 324 0 

AAAACTAGTC ATAATGGATT AGATTCTTTG TCATCTCATG AACAGGATTA TCCAGGTAAT 3300 

GCCAAAGAAA TGAAAGATTT AGATAAAAAA ATCGAAGAAA AAATTGCTGG CATTATGAAA 3360 
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CAATATGGTG TCAAACGTGA AAGTATTGTC GTGAATAAAG AAAAAAATGC GATTATTTAT 
CCGCATGGAG ATCACCATCA TGCAGATCCG ATTGATGAAC ATAAACCGGT TGGAATTGGT 
CATTCTCACA GTAACTATGA ACTGTTTAAA CCCGAAGAAG GAGTTGCTAA AAAAGAAGGG 
AATAAAGTTT ATACTGGAGA AGAATTAACG AATGTTGTTA ATTTGTTAAA AAATAGTACG 
TTTAATAATC AAAACTTTAC TCTAGCCAAT GGTCAAAAAC GCGTTTCTTT TAGTTTTCCG 
CCTGAATTGG AGAAAAAATT AGGTATCAAT ATGCTAGTAA AATTAATAAC ACCAGATGGA 
AAAGTATTGG AGAAAGTATC TGGTAAAGTA TTTGGAGAAG GAGTAGGGAA TATTGCAAAC 
TTTGAATTAG ATCAACCTTA TTTACCAGGA CAAACATTTA AGTATACTAT CGCTTCAAAA 
GATTATCCAG AAGTAAGTTA TGATGGTACA TTTACAGTTC CAACCTCTTT AGCTTACAAA 
ATGGCCAGTC AAACGATTTT CTATCCTTTC CATGCAGGGG ATACTTATTT AAGAGTGAAC 
CCTCAATTTG CAGTGCCTAA AGGAACTGAT GCTTTAGTCA GAGTGTTTGA TGAATTTCAT 
GGAAATGCTT ATTTAGAAAA TAACTATAAA GTTGGTGAAA TCAAATTACC GATTCCGAAA 
TTAAACCAAG GAACAACCAG AACGGCCGGA AATAAAATTC CTGTAACCTT CATGGCAAAT 
GCTTATTTGG ACAATCAATC GACTTATATT GTGGAAGTAC CTATCTTGGA AAAAGAAAAT 
CAAACTGATA AACCAAGTAT TCTACCACAA TTTAAAAGGA ATAAAGCACA AGAAAACTCA 
AAACTTGATG AAAAGGTAGA AGAACCAAAG ACTAGTGAGA AGGTAGAAAA AGAAAAACTT 
TCTGAAACTG GGAATAGTAC TAGTAATTCA ACGTTAGAAG AAGTTCCTAC AGTGGATCCT 
GTACAAGAAA AAGTAGCAAA ATTTGCTGAA AGTTATGGGA TGAAGCTAGA AAATGTCTTG 
TTTAATATGG ACGGAACAAT TGAATTATAT TTACCATCAG GAGAAGTCAT TAAAAAGAAT 
ATGGCAGATT TTACAGGAGA AGCACCTCAA GGAAATGGTG AAAATAAACC ATCTGAAAAT 
GGAAAAGTAT CTACTGGAAC AGTTGAGAAC CAACCAACAG AAAATAAACC AGCAGATTCT 
^^ GG CACCAAACGA AAAACCTGTA AAACCAGAAA ACTCAACGGA TAATGGAATG 
TTGAATCCAG AAGGGAATGT GGGGAGTGAC CCTATGTTAG ATCCAGCATT AGAGGAAGCT 
CCAGCAGTAG ATCCTGTACA AGAAAAATTA GAAAAATTTA CAGCTAGTTA CGGATTAGGC 
TTAGATAGTG TTATATTCAA TATGGATGGA ACGATTGAAT TAAGATTGCC AAGTGGAGAA 
^I^™**** AGAATTTATC TGATTTCATA GCGTAAGGAA TAGCAGTAGA AAAAGTCTGA 
ATCAAAAATG AAGTTCTCTC AAAAGTTAGA AATAAAACTC TGACTTTGGG AGAATTTCAT 
TTTATTATTA ATATATAAAA TTTCTTGACA TACAACTTAA AAAGAGGTGG AATATTTACT 
AGTTAATT (SEQ ID NO : 11) 
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CAGAGATCTT AQTGAATCAA ATATACTTAA GAAAAGAGGA AAGAATGAAA ATCAATAAAA 60 

AATATCTAGC TGGGTCAGTA GCTACACTTG TTTTAAGTGT CTGTGCTTAT GAACTAGGTT 120 

TGCATCAAGC TCAAACTGTA AAAGAAAATA ATCGTGTTTC CTATATAGAT GGAAAACAAG 180 

CGACGCAAAA AACGGAGAAT TTGACTCCTG ATGAGGTTAG CAAGCGTGAA GGAATCAACG 240 

CCGAACAAAT CGTCATCAAG ATTACGGATC AAGGTTATGT GACCTCTCAT GGAGACCATT 300 

ATCATTACTA TAATGGCAAG GTCCCTTATG ATGCCATCAT CAGTGAAGAG CTCCTCATGA 360 

AAGATCCGAA TTATCAGTTG AAGGATTCAG ACATTGTCAA TGAAATCAAG GGTGGTTATG 420 

TCATTAAGGT AAACGGTAAA TACTATGTTT ACCTTAAGGA TGCAGCTCAT GCGGATAATG 480 

TCCGTACAAA AGAAGAAATC AATCGGCAAA AACAAGAACA TAGTCAGCAT CGTGAAGGAG 540 

GGACTTCAGC AAACGATGGT GCGGTAGCCT TTGCACGTTC ACAGGGACGC TACACCACAG 600 

ATGATGGTTA TATCTTCAAT GCATCTGATA TCATCGAAGA TACGGGCGAT GCCTATATCG 660 

TTCCTCATGG AGATCATTAC CATTACATTC CTAAGAATGA GTTATCAGCT AGCGAOTTGG 720 

CTGCTGCAGA AGCCTTCCTA TCTGGTCGGG AAAATCTGTC AAATTTAAGA ACCTATCGCC 780 

GACAAAATAG CGATAACACT CCAAGAACAA ACTGGGTACC TTCTGTAAGC AATCCAGGAA 840 

CTACAAATAC TAACACAAGC AACAACAGCA ACACTAACAG TCAAGCAAGT CAAAGTAATG 900 

ACATTGATAG T CTCTT GAAA CAGCTCTACA AACTGCCTTT GAGTCAACGC CATGTAGAAT 960 
CTGATGGCCT TATTTTCGAC CCAGCGCAAA TCACAAGTCG AACCGCCAGA GGTGTAGCTG - 1020 

TCCCTCATGG TAACCATTAC CACTTTATCC CTTATGAACA AATGTCTGAA TTGGAAAAAC 1080 

GAATTGCTCG TATTATTCCC CTTCGTTATC GTTCAAACCA TTGGGTACCA GATTCAAGAC 1140 

CAGAAGAACC AAOTCCACAA CCGACTCCAG AACCTAGTCC AAGTCCGCAA CCTGCACCAA 1200 

ATCCTCAACC AGCTCCAAGC AATCCAATTG ATGAGAAATT GGTCAAAGAA GCTGTTCGAA 1260 

AA ^I AGGCGA TGGTTATGTC TTTGAGGAGA ATGGAGTTTC TCGTTATATC CCAGCCAAGA 1320 

ATCTTTCAGC AGAAACAGCA GCAGGCATTG ATAGCAAACT GGCCAAGCAG GAAAGTTTAT 1380 

CTCATAAGCT AGGAGCTAAG AAAACTGACC TCCCATCTAG TGATCGAGAA TTTTACAATA 1440 

AGGCTTATGA CTTACTAGCA AGAATTCACC AAGATTTACT TGATAATAAA GGTCGACAAG 1S00 

TTGATTTTGA GGCTTTGGAT AACCTGTTGG AACGACTCAA GGATGTCTCA AGTGATAAAG 1S60 

TCAAGTTAGT GGATGATATT CTTGCCTTCT TAGCTCCGAT TCGTCATCCA GAACGTTTAG 1620 

GAAAACCAAA TGCGCAAATT ACCTACACTG ATGATGAGAT TCAAGTAGCC AAGTTGGCAG 1680 

GCAAGTACAC AACAGAAGAC GGTTATATCT TTGATCCTCG TGATATAACC AGTGATGAGG 1740 

GGGATGCCTA TGTAACTCCA CATATGACCC ATAGCCACTG GATTAAAAAA GATAGTTTGT 1800 

CTGAAGCTGA GAGAGCGGCA GCCCAGGCTT ATGCTAAAGA GAAAGGTTTG ACCCCTCCTT 1860 

CGACAGACCA TCAGGATTCA GGAAATACTG AGGCAAAAGG AGCAGAAGCT ATCTACAACC 1920 

GCGTGAAAGC AGCTAAGAAG GTGCCACTTG ATCGTATGCC TTACAATCTT CAATATACTG 1980 

TAGAAGTCAA AAACGGTAGT TTAATCATAC CTCATTATGA CCATTACCAT AACATCAAAT 2040 

TTGAGTGGTT TGACGAAGGC CTTTATGAGG CACCTAAGGG GTATACTCTT GAGGATCTTT 2100 

TGGCGACTGT CAAGTACTAT GTCGAACATC CAAACGAACG TCCGCATTCA GATAATGGTT 2160 

TTGGTAACGC TAGCGACCAT GTTCAAAGAA ACAAAAATGG TCAAGCTGAT ACCAATCAAA 2220 

CGGAAAAACC AAGCGAGGAG AAACCTCAGA CAGAAAAACC TGAGGAAGAA ACCCCTCGAG 2280 

AAGAGAAACC ACAAAGCGAG AAACCAGAGT CTCCAAAACC AACAGAGGAA CCAGAAGAAG 2340 

AATCACCAGA GGAATCAGAA GAACCTCAGG TCGAGACTGA AAAGGTTGAA GAAAAACTGA 2400 

GAGAGGCTGA AGATTTACTT GGAAAAATCC AGGATCCAAT TATCAAGTCC AATGCCAAAG 2460 

AGACTCTCAC AGGATTAAAA AATAATTTAC TATTTGGCAC CCAGGACAAC AATACTATTA 2520 

TGGCAGAAGC TGAAAAACTA TTGGCTTTAT TAAAGGAGAG TAAGTAAAGG TAGCAGCATT 2580 

TTCTAACTCC TAAAAACAGG ATAGGAGAAC GGGAAAACGA AAAATGAGAG CAGAATGTGA 2640 

GTTCTAG (SED ID NO : 12) 2647 
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GGGTCTTAAA 
AAATCTGGAA 
TCAATAAAAA 
AACTTGGTCG 
ATGGTGATCA 
AGGGGATCAA 
ATGGAGACCA 
AACTTCTCAT 
AGGGTGGCTA 
ATGCGGACAA 
ATCATAACTC 
CGGATGATGG 
TCGTTCCTCA 
TAGCTGCTGC 
GTTATAATGC 
CAACTTATCA 
AACCCTTATC 
CAAGTCGAAC 
ATGAACAAAT 
CAAACCATTG 
CTAGTCCAAG 
AGAAATTGGT 
GAGTTTCTCG 
GCAAACTGGC 
CATCTAGTGA 
ATTTACTTGA 
GACTCAAGGA 
CTCCGATTCG 
ATGAGATTCA 
ATCCTCGTGA 
GCCACTGGAT 
CTAAAGAGAA 
CAAAAGGAGC 
GTATGCCTTA 
ATTATGACCA 
CTAAGGGGTA 
ACGAACGTCC 
AGGCAGACCA 
CTCACCCTGA 
TTTATAAACC 
AGGCTGAAAT 
CCTTGCTAGA 
GTCTAAAAAG 
ATAGTCTCTT 
(SEQ ID NO 



ACTCTGAATC 
GAAAATATGA 
ATATCTAGCA 
TCACCAAGCT 
GGCTGGTCAA 
CGCCGAACAA 
TTATCATTAC 
GAAAGATCCG 
TGTGATTAAG 
TATTCGGACA 
AAGAGCAGAT 
GTATATCTTC 
CGGCGACCAT 
AGAAGCCTAT 
AAATCCAGTT 
TCAAAATCAA 
AGAACGCCAT 
CGCCAGAGGT 
GTCTGAATTG 
GGTACCAGAT 
TCTGCAACCT 
CAAAGAAGCT 
TTATATCCCA 
CAAGCAGGAA 
TCGAGAATTT 
TAATAAAGGT 
TGTCTCAAGT 
TCATCCAGAA 
AGTAGCCAAG 
TATAACCAGT 
TAAAAAAGAT 
AGGTTTGACC 
AGAAGCTATC 
CAATCTTCAA 
TTACCATAAC 
TAGTCTTGAG 
GCATTCAGAT 
AGATAGTAAA 
ATCTGATGAA 
AAGCACTGAT 
TCCTCAAGTA 
AAAAGTAACA 
TAGTCTTCTT 
GGCTTTGTTA 
: 13) 



CTTTAGAGGC 
GTGTTCTAGC 
GGTTCAGTGG 
GGTCAGGTTA 
AAGGCAGAAA 
ATTGTTATCA 
TATAATGGCA 
AATTATCAGT 
GTAGACGGAA 
AAAGAAGAGA 
AATGCTGTTG 
AATGCATCTG 
TACCATTACA 
TGGAATGGGA 
CAACCAAGAT 
GGGGAAAACA 
GTAGAATCTG 
GTAGCTGTCC 
GAAAAACGAA 
TCAAGACCAG 
GCACCAAATC 
GTTCGAAAAG 
GCCAAGGATC 
AGTTTATCTC 
TACAATAAGG 
CGACAAGTTG 
GATAAAGTCA 
CGTTTAGGAA 
TTGGCAGGCA 
GATGAGGGGG 
AGTTTGTCTG 
CCTCCTTCGA 
TACAACCGCG 
TATACTGTAG 
ATCAAATTTG 
GATCTTTTGG 
AATGGTTTTG 
CCTGATGAAG 
AAAGAGAATC 
ACGGAAGAGA 
GAGAATTCTG 
GATCCTAGTA 
CTCGGAACGA 
AAAGAAAGTC 



AGACCCACAA 
AGAAGAATTA 
CAGTCCTTGC 
AGAAAGAGTC 
ATTTGACACC 
AGATTACGGA 
AGGTTCCTTA 
TGAAGGATTC 
AATACTATGT 
TTAAACGTCA 
CTGCAGCCAG 
ATATCATTGA 
TTCCTAAGAA 
AGCAGGGATC 
TGTCAGAGAA 
TTTCAAGCCT 
ATGGCCTTAT 
CTCATGGTAA 
TTGCTCGTAT 
AACAACCAAG 
CTCAACCAGC 
TAGGCGATGG 
TTTCAGCAGA 
ATAAGCTAGG 
CTTATGACTT 
ATTTTGAGGT 
AGTTAGTGGA 
AACCAAATGC 
AGTACACAAC 
ATGCCTATGT 
AAGCTGAGAG 
CAGACCACCA 
TGAAAGCAGC 
AAGTCAAAAA 
AGTGGTTTGA 
CGACTGTCAA 
GTAACGCTAG 
ATAAGGAACA 
ACGCTGGTTT 
CAGAGGAAGA 
TTATTAACGC 
TTAGACAAAA 
AAGATAATAA 
AACCGGCTCC 



AATGACAAGA 
AAGTGAGGAA 
CCTAAGTGTT 
TAATCGAGTT 
AGATGAAGTC 
TCAAGGTTAT 
TGATGCCATC 
AGACATTGTC 
TTACCTTAAA 
GAAGCAGGAA 
AGCCCAAGGA 
GGACACGGGT 
TGAGTTATCA 
TCGTCCTTCT 
CCACAATCTG 
TTTACGTGAA 
TTTCGACCCA 
CCATTACCAC 
TATTCCCCTT 
TCCACAATCG 
TCCAAGCAAT 
TTATGTCTTT 
AACAGCAGCA 
AGCTAAGAAA 
ACTAGCAAGA 
TTTGGATAAC 
TGATATTCTT 
GCAAATTACC 
AGAAGACGGT 
AACTCCACAT 
AGCGGCAGCC 
GGATTCAGGA 
TAAGAAGGTG 
CGGTAGTTTA 
CGAAGGCCTT 
GTACTATGTC 
TGACCATGTT 
TGATGAAGTA 
AAATCCTTCA 
AGCTGAAGAT 
TAAGATAGCA 
TGCTATGGAG 
CACTATTTCA 
TATACAGTAG 



cctatttaga 
agaatgaaaa 
tgttcctatg 
tcttatatag 
agtaagagag 
gtgacctctc 
atcagtgaag 
aatgaaatca 
gatgcggccc 
cacagtcata 
cgttatacaa 
gatgcttata 
gctagcgagt 
tcaagttcta 
actgtcactc 
ttgtatgcta 

GCGCAAATCA 
TTTATCCCTT 
CGTTATCGTT 
ACTCCGGAAC 
CCAATTGATG 
GAGGAGAATG 
GGCATTGATA 
ACTGACCTCC 
ATTCACCAAG 
CTGTTGGAAC 
GCCTTCTTAG 
TACACTGATG 
TATATCTTTG 
ATGACCCATA 
CAGGCTTATG 
AATACTGAGG 
CCACTTGATC 
ATCATACCTC 
TATGAGGCAC 
GAACATCCAA 
CGTAAAAATA 
AGTGAGCCAA 
GCAGATAATC 
ACCACAGATG 
GATGCGGAGG 
ACATTGACTG 
GCAGAAGTAG 
TAAAATGAA 
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MKINKKYLAG SVAVLALSVC SYELGRHQAG 
AENLTPDEVS KREGINAEQI VIKITDQGYV 
SEELLMKDPN YQLKDSDIVN EIKGGYVIKV 
EEIKRQKQEH SHNHNSRADN AVAAARAQGR 
AYIVPHGDHY HYIPKNELSA SELAAAEAYW 
PRLSENHNLT VTPTYHQNQG ENISSLLREL 
QITSRTARGV AVPHGNHYHF IPYEQMSELE 
RPEQPSPQST PEPSPSLQPA PNPQPAPSNP 
EHGVSRYIPA KDLSAETAAG IDSKLAKQES 
NKAYDLLARI HQDLLDNKGR QVDFEVLDNL 
FLAPIRHPER LGKPNAQITY TDDEIQVAKL 
EGDAYVTPHM THSHWIKKDS LSEAERAAAQ 
TEAKGAEAIY NRVKAAKKVP LDRMPYNLQY 
KFEWFDEGLY EAPKGYSLED LLATVKYYVE 
KNKADQDSKP DEDKEHDEVS EPTHPESDEK 
EETEEEAEDT TDEAEIPQVE NSVINAKIAD 
LTGLKSSLLL GTKDNNTISA EVDSLLALLK 
(SEQ ID NO : 14) 



QVKKESNRVS 


YIOGOQAGQK 


50 


TSHGDHYHYY 


NGKVPYOAII 


100 


LAjfv I i VI LitSJJ 


AAHADN I RTK 


150 


I TTDDG Y I FN 


ASDIIEDTGO 


200 


NGKQGSRPSS 


SS SYNANPVQ 


250 


XAKP US ERHV 


ESOGLIFOPA 


300 


VDThDTTBT B 

tSJilAKX X PliR 


YRSNHWVPOS 


3S0 


TnPifT.vifpau 




400 


LSHKLGAKKT 


DLPSSDREFY 


450 


LERLKDVSSD 


KVKIiVDOIXiA 


500 


AGKYTTEDGY 


IF0PRDIT5D 


550 


AYAKEKGLTP 


PSTDHQDSGN 


600 


TVEVKNGSLI 


IPHYDHYHNI 


650 


HPNERPHSDN 


GFGNASDHVR 


700 


ENHAGLNPSA 


DNLYKPSTOT 


750 


AEALLEKVTD 


PSIRQNAMET 


600 


ESQPAPIQ 




83B 
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TGTGCCTATG CACTAAACCA GCATCGTTCG CAGGAAAATA AGGACAATAA TCGTGTCTCT 
TATGTGGATG GCAGCCAGTC AAGTCAGAAA AGTGAAAACT TGACACCAGA CCAGGTTAGC 
^2^^° GAATTCAOGC TGAGCAAATT GTAATCAAAA TTACAGATCA GGGCTATGTA 
ACGTCACACG GTGATCACTA TCATTACTAT AATGGGAAAG TTCCTTATGA TGCCCTCTTT 
^™ GAAC TCTTGATGAA GGATCCAAAC TATCAACTTA AAGACGCTGA TATTGTCAAT 
GAAGTCAAGG GTGGTTATAT CATCAAGGTC GATGGAAAAT ATTATGTCTA CCTGAAAGAT 
GCAGCTCATG CTGATAATGT TCGAACTAAA GATGAAATCA ATCGTCAAAA ACAAGAACAT 
GTCAAAGATA ATGAGAAGGT TAACTCTAAT GTTGCTGTAG CAAGGTCTCA GGGACGATAT 
^ A ^I G ATGGrrATGT CTTTAATCCA GCTGATATTA TCGAAGATAC GGGTAATGCT 
CTCATGGAGG TCACTATCAC TACATTCCCA AAAGCGATTT ATCTGCTAGT 
St^ff CAGCTAAAGC ACATCTGGCT GGAAAAAATA TGCAACCGAG TCAGTTAAGC 
CACCTTCTCC ATCTCTTCCA ATCAATCCAG GAACTTCACA TGAGAAACAT 
° GATACGGATT TGATGCTAAT CGTATTATCG CTGAAGATGA ATCAGGTTTT 
^^" C ACGGAGACCA CAATCATTAT TTCTTCAAGA AGGACTTGAC AGAAGAGCAA 
A ZI^ A ™ G CGCAAAAACA TTTAGAGGAA GTTAAAACTA GTCATAATGG ATTAGATTCT 
^™" C ATGAACAGGA TTATCCAAGT AATGCCAAAG AAATGAAAGA TTTAGATAAA 
AAAATCGAAG AAAAAATTGC TGGCATTATG AAACAATATG GTGTCAAACG TGAAAGTATT 
^ f ^ ATA AAGAAAAAAA TGCGATTATT TATCCGCATG GAGATCACCA TCATGCAGAT 
^n™ A T° AACATAAACC GGTTGGAATT GGTCATTCTC ACAGTAACTA TGAACTGTTT 
^ff™ 0 AAGGAGTTGC TAAAAAAGAA GGGAATAAAG TTTATACTGG AGAAGAATTA 
ACGAATGTTG TTAATTTGTT AAAAAATAGT ACGTTTAATA ATCAAAACTT TACTCTAGCC 
^^f^ AACGCGTTTC TTTTAGTTTT CCGCCTGAAT TGGAGAAAAA ATTAGGTATC 
TAAAATTAAT AACACCAGAT GGAAAAGTAT TGGAGAAAGT ATCTGGTAAA 
™^ AAGGAGTAGG GAATATTGCA AACTCTGAAT TAGATCAACC TTATTTACCA 
f^^- TTAAGTATAC TATCGCTTCA AAAGATTATC CAGAAGTAAG TTATGATGGT 
^™^ G TTCCAACCTC TTT^CTTAC AAAATGGCCA GTCAAACGAT TTTCTATCCT 
TTCCATGCAG GGGATACTTA TTTAAGAGTG AACCCTCAAT TTGCAGTGCC TAAAGGAACT 
GATGCTTTAG TCAGAGTGTT TGATGAATTT CATGGAAATG CTTATTTAGA AAATAACTAT 
AAAGTTGGTG AAATCAAATT ACCGATTCCG AAATTAAACC AAGGAACAAC CAGAACGGCC 
GGAAATAAAA TTCCTGTAAC CTTCATGGCA AATGCTTATT TGGACAATCA ATCGACTTAT 
ATTGTGGAAG TACCTATCTT GGAAAAAGAA AATCAAACTG ATAAACCAAG TATTCTACCA 
^^ AAAA GGAATAAAGC ACAAGAAAAC TCAAAACTTG ATCAAAAGGT AGAAGAACCA 
AAGACTAGTG AGAAGGTAGA AAAAGAAAAA CTTTCTGAAA CTGGGAATAG TACTAGTAAT 
TCAACGTTAG AAGAAGTTCC TACAGTGGAT CCTGTACAAG AAAAAGTAGC AAAATTTGCT 
GAAAGTTATG GGATGAAGCT AGAAAATGTC TTGTTTAATA TGGACGGAAC AATTGAATTA 
TATTTACCAT CGGGAGAAGT CATTAAAAAG AATATGGCAG ATTTTACAGG AGAAGCACCT 
CAAGGAAATG GTGAAAATAA ACCATCTGAA AATGGAAAAG TATCTACTGG AACAGTTGAG 
AACCAACCAA CAGAAAATAA ACCAGCAGAT TCTTTACCAG AGGCACCAAA CGAAAAACCT 
^I AA ^ CCAG AAAACTCAAC GGATAATGGA ATGTTGAATC CAGAAGGGAA TGTGGGGAGT 
GACCCTATGT TAGATTCAGC ATTAGAGGAA GCTCCAGCAG TAGATCCTGT ACAAGAAAAA 
TTAGAAAAAT TTACAGCTAG TTACGGATTA GGCTTAGATA GTGTTATATT CAATATGGAT 
GGAACGATTG AATTAAGATT GCCAAGTGGA GAAGTGATAA AAAAGAATTT ATTGATCTCA 
TAGCGTAA (SEQ ID NO : 15) 
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CAYALNQHRS QENKDNNRVS YVDGSQSSQK SENLTPDQVS QKEGIQAEQI 
VI KITDQG YV TSHGDHYHYY NGKVPYDALF SEELLMKDPN YOLKDADIVN 
vSe™ DQKYYVYLKD AAHADNVRTK DEINRQKQEH VKDMEKVNSN 
VAVARSQGRY TTNDOYVFNP ADIIEDTGNA YIVPHGGHYH YIPKSDLSAS 
EiAAAKAHLA GKNMQPSQLS YSSTPSPSLP INPGTSHEKH EEDGYOFDAN 
t e™nf P VMM0D »« Sf FFKKDLTEEQ IKAAQKHLEE VKTSHNGLDS 
LSSHEQDYPS NAKEMKDLDK KIEEKIAQIM KQYGVKRESI WNKEKNAII 
SSSfSS P1DEHKPVGI °HSHSNYELF KPEEGVAKKE 
TNVVNLLKNS TFNNQNFTLA NGQKRVSFSF PPELEKKLGI NMLVKLITPD 

f HLOOPYLP GQTFKYTIAS SSSSS 
TFTVPTSLAY KMASQTIFYP FHAGDTYLRV NPQFAVPKGT DALVRVFDEF 
HGNAYLENNY KVGEIKLPIP KLNQGTTRTA GNKIPVTFMA NAYLDNOSTY 
IVEVPILEKE NQTDKPSILP QFKRNKAQEN SKLDEKVEEP KTS^SS 
SESSi ESSE PVQEKVAKFA ESYGMKLEHV 5SE52 
YLPSGEVIKK NMADFTGEAP QGNGENKPSE KGKVSTGTVE NQPTENKPAD 
SLPEAPNEKP VKPENSTDNG MLNPEGNVGS DPMLDSALEE ApSSJ££ 
LEKFTASYGL GLOSVIFNMD GTIELRLPSG EVIKKNLLIS ^ 
(SEQ ID NO : 16) 840 

FIGURE 19 
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100 
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200 
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^v™™, QEMKDNNRVS WDGSQSSQK SENLTPDQVS QKEGIQAEQI 
VIKITDQGYV TSHGDHYHYY NGKVPYDALF SEELLMKDPN YOLKDADIVN 
EVKGOYIIKV DQKYYVYLKD AAHADNVRTK DEINRQKQEH 
VAVARSQGRY TTNDGYVFNP ADIIEDTGNA YIVPHGGHYH HpSdmSE 
E^AAAKAHLA GKNMQPSQLS YSSTASDNNT QSVAKGSTSK l££££i 
LLKELYDSPS AQRYSESDGL VFDPAKIISR TPNGVAIPHG DHYHFIPYSK 
LSALEEKIAR MVPISGTGST VSTNAKPNEV VSSLGSLSSN PSSLTtSl 
SSASDGYIFN PKDIVEETAT AYIVRHGDHF HYIPKSNQIG QwLSS 
TPSPSLPINP GTSHEKHEED GYGFDANRII AEDESGFVMS 
KDLTEEQIKA AQKHLEEVKT SHNGLDSLSS HEQDYPGNAK EMkSSSe 
EKIAGIMKQY GVKRESIWN KEKNAIIYPH GDfUiHADPID EhS^Shs 
EGVAKKEGNK VYTGEELTNV VNLLKNSTFN SSS£ 
™f5ff™ LEKKLGINML VKLITPDGKV LEKVSGKVFG EGVGNIANFE 
LDQPYLPGQT FKYTIASKDY PEVSYDGTFT VPTSLAYKMA SQTIFYPFHA 
SE^Z 0 FMP « GTDM < VRVFDEFHGN AYLENNYKVG EIKLPIPKLN 
QGTTRTAGNK IPVTFMANAY LDNQSTYIVE VPILEKENQT DKPSILPQFK 
RNKAQENSKL DEKVEEPKTS EKVEKEKLSE TGNSTSNSTL EEVPTV^WQ 
^™ SY GMKLENVLFN MDGTIELYLP SGEVIKKNMA JESSES 
^™fn GK VSTGTVEN0P TENKPADSLP EAPNEKPVKP ENSTDNGMLN 
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CAYALNQHRS QENKDNNRVS YVDGSQSSQK 
VIKITDQGYV TSHGDHYHYY NGKVPYDALF 
EVKGGYIIKV DGKYYVYLKD AAHADNVRTK 
VAVARSQGRY TTNDGYVFNP ADIIEDTGNA 
ELAAAKAHLA GKNMQPSQLS YSSTASDNNT 
LLKELYDSPS AQRYSESDGL VFDPAKIISR 
LSALEEKIAR MVPISGTGST VSTNAKPNEV 
SSASDGYIFN PKDIVEETAT AYIVRHGDHF 
TPSPSLPINP GTSHEKHEED GYGFDANRII 
KDLTEEQIKA AQKHLEEVKT SHNGLDSLSS 
(SEQ ID NO : 56) 



SENLTPDQVS QKEGIQAEQI SO 

SEELLMKDPN YQLKDADIVN 100 

DEINRQKQEH VKDNEKVNSN X50 

YIVPHGGHYH YIPKSDLSAS 200 

QSVAKGSTSK PANKSENLQS 250 

TPNGVAIPHG DHYHFIPYSK 300 

VSSLGSLSSN PSSLTTSKEL 350 

HYIPKSNQIG QPTLPNNSLA 400 

AEDESGFVMS HGDHNHYFFK 450 

HEQOYPGNA 489 
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GSAVIVSLSL CAYALNQHRS QENKDNNRVS YVDGSQSSQK SENLTPDQVS 
QKEGIQAEQI VIKITDQGYV TSHGDHYHYY NGKVPYDALF SEELLMKDPK YQlSaDI^ 
EVKGGYIIKV DGKYYVYLKD AAHADNVRTK DEINRQKQEH VKDNEKVNSN S5 

ADIIEDTGNA YIVPHGGHYH YIPKSDLSAS ELAAAKAHLA 2E5SE 240 

YSSTASDNNT QSVAKGSTSK PANKSENLQS LLKELYDSPS AQRYSESDGL VFDPAKIISR Inn 

PS*™"? DHYHFIPYSK LSALEEKIAR MVPISGTGST VSTNaSS ^SSSS III 

SSASDGYIFN PKDIVEETAT AYIVRHGDHF HYIPKSNQIG QPTLPNNSLA 420 

TPSPSLPINP GTSHEKHEED GYGFDANRII AEDESGFVMS HGDHNHYFFK KDLTEEQIKA 4fl0 

AQKHLEEVKT SHNGLDSLSS HEQDYPGNA (SEQ ID NO • 57) KDLTEEQIKA 480 



60 
120 
160 
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DLTEEQIKAA QKHLEEVKTS HNGLDSLSSH EQDYPGNAKE MKDLDKKIEE 
KIAGIMKQYG VKRESIWNK EKNAIIYPHG DHHHADPIDE HKPVGIGHSH 
t w™ KPEE GVAKKEGNKV VTGEELTNW NLLKNSTFNN QNFTLANGQK 
RVSFSFPPEL EKKLGINMLV KLITPDGKVL EKVSGKVFGE GVGNIANFEL 
DQPYLPGQTF KYTIASKDYP EVSYDGTFTV PTSLAYKMAS QTI FYPFHAG 
DTYLRVNPQF AVPKGTDALV RVFDEFHGNA YLENNYKVGE IKLPIPKLNQ 
GTTRTAGNKI PVTFMANAYL DNQSTYIVEV PILEKENQTD KPSILPQFKR 
NKAQENSKLD EKVEEPKTSE KVEKEKLSET GNSTSNSTLE EVPTVDPVQE 
KVAKFAESYG MKLENVLFNM DGTIELYLPS GEVIKKNMAD FTGEAPQGNG 
ENKPSENGKV STGTVENQPT ENKPADSLPE APNEKPVKPE NSTDNGMLNP 
EGNVGSDPML DPALEEAPAV DPVQEKLEKF TASYGLGLDS VIFNMDGTIE 
LRLPSGEVIK KNLSDFIAKL RYRSNHWVPD SRPEEPSPQP TPEPSPSPQP 
APNPQPAPSN PIDEKLVKEA VRKVGDGYVF EENGVSRYIP AKNLSAETAA 
GIDSKLAKQE SLSHKLGAKK TDLPSSDREF YNKAYDLLAR IHQDLLDNKG 700 
RQVDFEALDN LLERLKD VSS DKVKLVDDIL AFLAPIRHPE RLGKPNAQIT 750 
YTDDEIQVAK LAGKYTTEDG YIFDPRDITS DEGDAYVTPH MTHSHWIKKD 800 
SLSEAERAAA QAYAKEKGLT PPSTDHQDSG NTEAKGAEAI YNRVKAAKKV 850 
PLDRMPYNLQ YTVEVKNGSL IIPHYDHYHN IKFEWFDEGL YEAPKGYTLE 900 
DLLATVKYYV EHPNERPHSD NGFGNASDHV QRNKNGQADT NQTEKPSEEK 950 
PQTEKPEEET PREEKPQSEK PESPKPTEEP EEESPEESEE PQVETEKVEE 1000 
KLREAEDL^G KIQDPIIKSN AKETLTGLKN NLLFGTQDNN TIMAEAEKLL 1050 
ALLKESK (SEQ ID NO : 58) 

FIGURE 23 
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CAYALNQHRS QENKDNNRVS YVDGSQSSQK 
VIKITDQGYV TSHGDHYHYY NGKVPYDALF 
EVKGGYIIKV DOKYYVYLKD AAHADNVRTK 
VAVARSQGRY TTNDGYVFNP ADIIEDTGNA 
ELAAA (SEQ ID NO : 59) 



SENLTPDQVS QKEGIQAEQI SO 
SEELLMKDPN YQLKDADIVN 100 
DEINRQKQEH VKDNEKVNSN 150 
YIVPHGGHYH YIPKSDLSAS 200 

205 

FIGURE 24 



CAYELGLHQA QTVKENNRVS YIDGKQATQK TENLTPDEVS KREGINAEQI 50 

VIKITDQGYV TSHGDHYHYY NGKVP YDAI I SEELLMKDPN YQLKDSDIVN 100 

EIKGGYVIKV NGKYYVYLKD AAHADNVRTK EEINRQKQEH SQHREGGTSA ISO 

NDGAVAFARS QGRYTTDDGY IFNASDIIED TGDAYIVPHG DHYHYIPKNE 200 

LSASELAAAE AFL5GRENLS NLRTYRRQNS DNTPRTNWVP SVSNPGTTNT 250 

NTSNNSNTNS QASQSNDIDS LLKQLYKLPL SQRHVESDGL IFDPAQITSR 300 

TARGVAVPHG NHYHFIPYEQ MSELEKRIAR IIPLRYRSNH WVPDSRPEEP 350 

SPQPTPEPSP SPQPAPNPQP APSNPIDEKL VKEAVRKVGD GYVFEENGVS 400 

RYIPAKNLSA ETAAGIDSKL AKQESLSHKL GAKKTDLPSS DREFYNKAYD 450 

LLARIHQDLL DNKGRQVDFE ALDNLLERLK DVSSDKVKLV DDILAFLAPI 500 

RHPERLGKPN AQITYTDDEI QVAKLAGKYT TEDGYIFDPR DITSDEGDAY 550 

VTPHMTHSHW IKKDSLSEAE RAAAQAYAKE KGLTPPSTDH QDSGNTEAKG 600 

AEAIYNRVKA AKKVPLDRMP YNLQYTVEVK NGSLIIPHYD HYHNIKFEWF 650 

DEGLYEAPKG YTLEDLLATV KYYVEHPNER PHSDNGFGNA SDHVQRNKNG 700 

QADTNQTEKP SEEKPQTEKP EEETPREEKP QSEKPESPKP TEEPEEESPE 7 SO 

ESEEPQVETE KVEEKLREAE DLLGKIQDPI IKSNAKETLT GLKNNLLFGT 800 

QDNNTIMAEA EKLLALLKES K { (SEQ ID NO : 60) 821 

FIGURE 25 



CAYELGLHQA QTVKENNRVS YIDGKQATQK 
VIKITDQGYV TSHGDHYHYY NGKVP YDAI I 
EIKGGYVIKV NGKYYVYLKD AAHADNVRTK 
NDGAVAFARS QGRYTTDDGY IFNASDIIED 
LSASELAAAE AFLSGRENLS NLRTYRRQNS 
NTSNNSNTNS QASQSNDIDS LLKQLYKLPL 
TARGVAVPHG NHYHFIPYEQ MSELEKRIAR 
(SEQ ID NO : 61) 



TENLTPDEVS KREGINAEQI 50 

SEELLMKDPN YQLKDSDIVN 100 

EEINRQKQEH SQHREGGTSA 150 

TGDAYIVPHG DHYHYIPKNE 200 

DNTPRTNWVP SVSNPGTTNT 250 

SQRHVESDGL IFDPAQITSR 300 

IIPL 334 

FIGURE 26 



RYRSNHWVPD SRPEEPSPQP TPEPSPSPQP 
VRKVGDGYVF EENGVSRYIP AKNLSAETAA 
TDLPSSDREF YNKAYDLLAR IHQDLLDNKG 
DKVKLVDDIL AFLAPIRHPE RLGKPNAQIT 
YIFDPRDITS DEGDAYVTPH MTHSHWIKKD 
PPSTDHQDSG NTEAKGAEAI YNRVKAAKKV 
IIPHYDHYHN IKFEWFDEGL YEAPKGYTLE 
NGFGNASDHV QRNKNGQADT NQTEKPSEEK 
PESPKPTEJEP EEESPEESEE PQVETEKVEE 
AKETLTGLKN NLLFGTQDNN TIMAEAEKLL 
(SEQ ID NO : 62) 



APNPQPAPSN PIDEKLVKEA 50 

GIDSKLAKQE SLSHKLGAKK 100 

RQVDFEALDN LLERLKDVSS ISO 

YTDDEIQVAK LAGKYTTEDG 200 

SLSEAERAAA QAYAKEKGLT 250 

PLDRMPYNLQ YTVEVKNGSL 300 

DLLATVKYYV EHPNERPHSD 350 

PQTEKPEEET PREEKPQSEK 400 

KLREAEDLLG KIQDPIIKSN 450 

ALLKESK 487 

FIGURE 27 
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AEAFLSGREN LSNLRTYRRQ NSDNTPRTNW VPSVSNPGTT NTNTSNNSNT 50 

NSQASQSNDI DSLLKQLYKL PLSQRHVESD GLIFDPAQIT SRTARGVAVP 100 

HGNHYHFIPY EQMSELEKRI ARIIPLRYRS NHWVPDSRPE EPSPQPTPEP ISO 

SPSPQPAPNP QPAPSNPIDE KLVKEAVRKV GDGYVFEENG VSRYIPAKNL 200 

SAETAAGIDS KLAKQESLSH KLGAKKTDLP SS DREFYNKA YDLLARIHQD 250 

LLDNKGRQVD FEALDNLLER LKDVSSDKVK LVDDILAFLA PIRHPERLGK 300 

PNAQITYTDD EIQVAKLAGK YTTEDGYIFD PRDITSDEGD AYVTPHMTHS 350 

HWIKKDSLSE AERAAAQAYA KEKGLTPPST OHQOSGNTEA KGAEAIYNRV 400 

KAAKKVPLDR MPYNLQYTVE VKNGSLIIPH YDHYHNIKFE WFDEGLYEAP 450 

KGYTLEDLLA TVKYYVEHPN ERPHSDNGFG NASDHVQRNK NGQADTNQTE 500 

KPSEEKPQTE KPEEETPREE KPQSEKPESP KPTEEPEEES PEESEEPQVE 550 

TEKVEEKLRE AEDLLGKIQD PIIKSNAKET LTGLKNNLLF GTQDNNTIMA 600 

EAEKLLALLK ESK (SEQ ID NO : 63) 613 

FIGURE 28 



DLTEEQIKAA QKHLEEVKTS HNGLDSLSSH EQDYPGNAKE MKDLDKKIEE so 

KIAGIMKQYG VKRESIWNK EKNAIIYPHG DHHHADPIDE HKPVGIGHSH 100 

SNYELFKPEE GVAKKEGNKV YTGEELTNW NLLKNSTFNN QNFTLANGQK l 50 

RVSFSFPPEL EKKLGINMLV KLITPDGKVL EKVSGKVFGE GVGNIANFEL 200 

DQPYLPGQTF KYTIASKDYP EVSYDGTFTV PTSLAYKMAS QTIFYPFHAG 250 

DTYLRVNPQF AVPKGTDALV RVFDEFHGNA YLENNYKVGE IKLPIPKLNQ 300 

GTTRTAGNKI PVTFMANAYL DNQSTYIVEV PILEKENQTD KPSILPQFKR 3 50 

NKAQENSKLD EKVEEPKTSE KVEKEKLSET GNSTSNSTLE EVPTVDPVQE 400 

KVAKFAESYG MKLENVLFNM DGTIELYLPS GEVIKKNMAD FTGEAPQGNG 450 

ENKPSENGKV STGTVENQPT ENKPADSLPE APNEKPVKPE NSTDNGMLNP S00 

EGNVGSDPML DPALEEAPAV DPVQEKLEKF TASYGLGLDS VIFNMDGTIE 550 

LRLPSGEVIK KNLSDFIA (SEQ ID NO : 64) 56B 

FIGURE 29 



DLTEEQIKAA QKHLEEVKTS HNGLDSLSSH EQDYPGNAKE MKDLDKKIEE 50 

KIAGIMKQYG VKRESIWNK EKNAIIYPHG DHHHADPIDE HKPVGIGHSH 100 

SNYELFKPEE GVAKKEGNKV YTGEELTNW NLLKNSTFNN QNFTLANGQK 150 

RVSFSFPPEL EKKLGINMLV KLITPDGKVL EKVSGKVFGE GVGNIANFEL 200 

DQPYLPGQTF KYTIASKDYP EVSYDGTFTV PTSLAYKMAS QTIFYPFHAG 250 

DTYLRVNPQF AVPKGTDALV RVFDEFHGNA YLENNYKVGE IKLPIPKLNQ 300 

GTTRTAGNKI PVTFMANAYL DNQSTYIVE (SEQ ID NO : 65) 329 

FIGURE 3 0 
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EVPILEKENQ TDKPS I LPQF KRNKAQENSK 
ETGNSTSNST LEEVPTVDPV QEKVAKFAES 
PSGEVIKKNM ADFTGEAPQG NGENKPSENG 
PEAPNEKPVK PENSTDNGML NPEGNVGSDP 
KFTASYGLGL DSVIFNMOGT IELRLPSGEV 
(SEQ ID HO : 66) 



LOEKVEEPKT SEKVEKEKLS 50 

YGMKLENVLF NMDGTIELYL 100 

KVSTGTVENQ PTENKPAJDSL 150 

MLDPALEEAP AVDPVQEKLE 200 

IKKNLSDFIA 240 

FIGURE 31 



DIDSLLKQLY KLPL5QRKVE SDGLIFDPAQ 
PYEQMSELEK RIARIIPLRY RSNHWVPDSR 
NPQPAPSNPI DEKLVKEAVR KVGDGYVFEE 
DSKLAKQESL SHKLGAKKTD LPSSDREFYN 
VDFEALDNLL ERLKDVSSDK VKLVDDILAF 
DDEIQVAKLA GKYTTEDGYI FDPRDITSDE 
SEAERAAAQA YAKEKGLTPP STDHQDSGNT 
DRMPYNLQYT VEVKNGSLII PHYDHYHNIK 
LATVKYYVEH PNERPHSDNG FGNASDHVQR 
TEKPEEETPR EEKPQSEKPE SPKPTEEPEE 
REAEDLLGKI QDPIIKSKAK ETLTGLKNNL 
LKESK (SEQ ID NO : 67) 



ITSRTARGVA VPHGNHYHFI 50 

PEEPS PQPTP EPS PS PQ PAP 100 

NGVSRYIPAK NLSAETAAGI 150 

KAYDLLARIH QDLLDNKGRQ 200 

LAP I RH PERL GKPNAQITYT 250 

GDAYVTPHMT HSHWIKKDSL 300 

EAKGAEAIYN RVKAAKKVPL 350 

FEWFDEGLYE APKGYTLEDL 400 

NKNGQADTNQ TEKPSEEKPQ 450 

ESPEESEEPQ VETEKVEEKL 500 

LFGTQDNNTI MAEAEKLLAL 550 

555 

FIGURE 32 



DIDSLLKQLY KLPLSQRHVE SDGLIFDPAQ 
PYEQMSELEK RIARIIPLRY RSNHWVPDSR 
NPQPAPSNPI DEKLVKEAVR KVGDGYVFEE 
DSKLAKQESL SHKLGAKKTD LPSSDREFYN 
VDFEALDNLL ERLKDVSSDK VKLVDDILAF 
DDEIQVAKLA GKYTTEDGYI FDPRDITSDE 
SEAERAAAQA YAKEKGLTPP STDHQDSGNT 
DRMPYNLQYT VEVKNGSLII PHYDHYHNIK 
LATVKYYVEH PNERPHSDNG FGNASDHV 



ITSRTARGVA VPHGNHYHFI 50 

PEEPSPQPTP EPSPSPQPAP 100 

NGVSRYIPAK NLSAETAAGI ISO 

KAYDLLARIH QDLLDNKGRQ 200 

LAPIRHPERL GKPNAQITYT 250 

GDAYVTPHMT HSHWIKKDSL 300 

EAKGAEAIYN RVKAAKKVPL 350 

FEWFDEGLYE APKGYTLEDL 400 

(SEQ ID NO : 68) 428 
FIGURE 33 



GLYEAPKGYT LEDLLATVKY YVEHPNERPH SDNGFGNASD HVQRNKNGQA 
DTNQTEKPSE EKPQTEKPEE ETPREEKPQS EKPESPKPTE EPEEESPEES 
EEPQVETEKV EEKLREAEDL L (SEQ ID NO : 69) 

FIGURE 34 



ASDHVQRNKN GQADTNQTEK PSEEKPQTEK PEEETPREEK PQSEKPESPK 
PTEEPEEESP EESEEPQVET EKVEEKLREA EDLLGKIQDP IIKSNAKETL 
TGLKNNLLFG TQDNNTIMAE AEKLLALLKE SK 
(SEQ ID NO : 70) 

, FIGURE 35 
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DIDSLLKQLY KLPLSQRHVE SDGLIFDPAQ ITSRTAROVA VPHGNHYHFI 
PYEQMSELEK RIARIIPLRY RSNHWVPDSR PEEPSPQPTP EPSPSPQPAP 
NPQPAPSNPI DEKLVKEAVR KVGDGYVFEE NGVSRYIPAK NLSAETAAGI 
DSKLAKQESL SHKLGAKKTD LPSSDREFYN KAYDLLARIH QDLLDNKGRQ 
VDFEALDNLL ERLKDVSSDK VKLVDD (SEQ ID NO : 71) 

FIGURE 36 



DILAFIAPIR HPERLGKPNA QITYTDDEIQ VAKLAGKYTT EDGYIFDPRD 
ITSDEGDAYV TPHMTHSHWI KKDSLSEAER AAAQAYAKEK GLTPPSTDHQ 
DSGNTEAKGA EAIYNRVKAA KKVPLDRMPY NLQYTVEVKM GSLIIPHYDH 
YHNIKFEWFD EGLYEAPKGY TLEDLLATVK YYVEHPNERP HSDNGFGNAS 
DHV (SEQ ID NO : 72) 

FIGURE 37 



CSYELGRHQA GQVKKESNRV SYIDGDQAGQ KAENLTPDEV SKREGINAEQ 
IVIKITDQGY VTSHGDHYHY YNGKVPYDAI ISEELLMKDP NYQLKDSDIV 
NEIKGGYVIK VDGKYYVYLK DAAHADNIRT KEEIKRQKQE HSHNHNSRAD 
NAVAAARAQG RYTTDDGYIF NASDIIEDTG DAYIVPHGDH YHYIPKNELS 
ASELAAAEAY WNGKQGSRPS SSSSYNANPV QPRLSENHNL TVTPTYHQNQ 
GENISSLLRE LYAKPLSERH VESDGLIFDP AQITSRTARG VAVPHGNHYH 
FIPYEQMSEL EKRIARIIPL RYRSNHWVPD SRPEQPSPQS TPEPSPSLQP 
APNPQPAPSN PIDEKLVKEA VRKVGDGYVF EENGVSRYIP AKDLSAETAA 
GIDSKLAKQE SLSHKLGAKK TDLPSSDREF YNKAYDLLAR IHQDLLDNKG 
RQVDFEVLDN LLERLKDVSS DKVKLVDDIL AFLAPIRHPE RLGKPNAQIT 
YTDDEIQVAK LAGKYTTEDG YIFDPRDITS DEGDAYVTPK MTHSHWIKKD 
SLSEAERMA QAYAKEKGLT PPSTDHQDSG NTEAKGAEAI YNRVKAAKKV 
PLDRMPYNLQ YTVEVKNGSL IIPHYDHYHN IKFEHFDEGL YEAPKGYSLE 
DLLATVKYYV EHPNERPHSD NGFGNASDHV RKNKADQDSK PDEDKEHDEV 
SEPTHPESDE KENHAGLNPS ADNLYKPSTD TEETEEEAED TTDEAEIPQV 
ENSVINAKIA DAEAILEKVT DPSIRQNAME TLTGLKSSLL LGTKDNNTIS 
AEVDSLLALL KESQPAPIQ (SEQ ID NO : 73) 

FIGURE 38 



ENISSLLREL YAKPLSERHV ESDGLIFDPA QITSRTARGV AVPHGNHYHF 

IPYEQMSELE KRIARIIPLR YRSNHWVPDS RPEQPSPQST PEPSPSLQPA 

PNPQPAPSNP IDEKLVKEAV RKVGDGYVFE ENGVSRYIPA KDLSAETAAG 

IDSKLAKQES ISHKLGAKKT DLPSSDREFY NKAYDLLARI HQDLLDNKGR 

QVDFEVLDNL LERLKDVSSD KVKLVDDILA FLAPIRHPER LGKPNAQITY 

TDDEIQVAKL AGKYTTEDGY IFDPRDITSD EGDAYVTPHM THSHHIKKDS 

LSEAERAAAQ AYAKEKGLTP PSTDHQDSGN TEAKGAEAIY NRVKAAKKVP 

LDRMPYNLQY TVEVKNGSLI IPHYDHYHNI KFEWFDEGLY EAPKGYSLED 

LLATVKYYVE HPNERPHSDN GFGNASDHVR KNKADQDSKP DEDKEHDEVS 
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VRKNKADQDS KPDEDKEHDE VSEPTHPESD EKENHAGLNP SADNLYKPST 
DTEETEEEAE DTTDEAEIPQ VENSVINAKI ADAEALLEKV TDPSIRQNAM 
ETLTGLKSSL LLGTKDNNTI SAEVDSLLAL LKESQPAPIQ 
(SEQ ID NO : 75) 

FIGURE 4 0 



GACTTGACAG AAGAGCAAAT TAAGGCTGCG 
TAAAACTAGT CATAATGGAT TAGATTCTTT 
ATCCAGGTAA TGCCAAAGAA ATGAAAGATT 
AAAATTGCTG GCATTATGAA ACAATATGGT 
CGTGAATAAA GAAAAAAATG CGATTATTTA 
ATGCAGATCC GATTGATGAA CATAAACCGG 
AGTAACTATG AACTGTTTAA ACCCGAAGAA 
GAATAAAGTT TATACTGGAG AAGAATTAAC 
AAAATAGTAC GTTTAATAAT CAAAACTTTA 
CGCGTTTCTT TTAGTTTTCC GCCTGAATTG 
TATGCTAGTA AAATTAATAA CACCAGATGG 
CTGGTAAAGT ATTTGGAGAA GGAGTAGGGA 
GATCAACCTT ATTTACCAGG ACAAACATTT 
AGATTATCCA GAAGTAAGTT ATGATGGTAC 
TAGCTTACAA AATGGCCAGT CAAACGATTT 
GATACTTATT TAAGAGTGAA CCCTCAATTT 
TGCTTTAGTC AGAGTGTTTG ATGAATTTCA 
ATAACTATAA AGTTGGTGAA ATCAAATTAC 
GGAACAACCA GAACGGCCGG AAATAAAATT 
TGCTTATTTG GACAATCAAT CGACTTATAT 
AAAAAGAAAA TCAAACTGAT AAACCAAGTA 
AATAAAGCAC AAGAAAACTC AAAACTTGAT 
GACTAGTGAG AAGGTAGAAA AAGAAAAACT 
CTAGTAATTC AACGTTAGAA GAAGTTCCTA 
AAAG TAGCAA AATTTGCTGA AAGTTATGGG 
GTTTAATATG GACGGAACAA TTGAATTATA 
TTAAAAAGAA TATGGCAGAT TTTACAGGAG 
GAAAATAAAC CATCTGAAAA TGGAAAAGTA 
CCAACCAACA GAAAATAAAC CAGCAGATTC 
AAAAACCTGT AAAACCAGAA AACTCAACGG 
GAAGGGAATG TGGGGAGTGA CCCTATGTTA 
TCCAGCAGTA GATCCTGTAC AAGAAAAATT 
ACGGATTAGG CTTAGATAGT GTTATATTCA 
TTAAGATTGC CAAGTGGAGA AGTGATAAAA 
AGCGAAGCTT CGTTATCGTT CAAACCATTG 
AAGAACCAAG TCCACAACCG ACTCCAGAAC 
GCACCAAATC CTCAACCAGC TCCAAGCAAT 
CAAAGAAGCT GTTCGAAAAG TAGGCGATGG 
GAGTTTCTCG TTATATCCCA GCCAAGAATC 
GGCATTGATA GCAAACTGGC CAAGCAGGAA 
AGCTAAGAAA ACTGACCTCC CATCTAGTGA 
CTTATGApTT ACTAGCAAGA ATTCACCAAG 
CGACAAGTTG ATTTTGAGGC TTTGGATAAC 
TGTCTCAAGT GATAAAGTCA AGTTAGTGGA 
CTCCGATTCG TCATCCAGAA CGTTTAGGAA 
TACACTGATG ATGAGATTCA AGTAGCCAAG 
AGAAGACGGT TATATCTTTG ATCCTCGTGA 
ATGCCTATGT AACTCCACAT ATGACCCATA 



CAAA AAPATT 


Thfzhnnh Af2*r 


50 


V X Ull V— X Wll 


fz A A r» a nn a t*t 


100 


inuHiivwin 


A A *V^r* A A O A IV 


150 


GTCTAAAPfiTfi 


AAAGTATTGT 


200 






250 


TTfif? A A TTnn 


TCATTCTCAC 


300 


UUAUl Xf\ 


A A A A IP A Artf* 


350 


fill TrtTTPTT 


AAl 1 iviT AA 


400 


\mX\m x AwwUAA 


TGGTCAAAAA 


450 




TAGGTATCAA 


500 


AnAAulAl lu 


GAGAAAGTAT 


550 


A T A T*TY2 P A A A 


CTTTGAATTA 


600 


AAf5TATAf*TA 




650 


r\X X Xn\**\\3 X 1 


CCAACCTCTT 


700 


T CTATC CTTT 


C CATG CAGGG 


750 


« k-Jv*x\» \» WTA 


AAGGAACTGA 


800 


1 uUAAATU L, 1 


TATTTAGAAA 


650 


CGATTCCGAA 


ATTAAACCAA 


900 


CCTGTAACCT 


TCATGGCAAA 


950 


TGTGGAAGTA 


CCTATCTTGG 


1000 


TTCTACCACA 


ATTTAAAAGG 


1050 


GAAAAGGTAG 


AAGAACCAAA 


1100 


TTCTGAAACT 


GGGAATAGTA 


1150 


CAGTGGATCC 


TGTACAAGAA 


1200 


ATGAAGCTAG 


AAAATGTCTT 


1250 


TTTACCATCA 


GGAGAAGTCA 


1300 


AAGCACCTCA 


AGGAAATGGT 


1350 


TCTACTGGAA 


CAGTTGAGAA 


1400 


TTTACCAGAG 


GCACCAAACG 


1450 


ATAATGGAAT 


GTTGAATCCA 


1500 


GATCCAGCAT 


TAGAGGAAGC 


1550 


AGAAAAATTT 


ACAG CTAGTT 


1600 


ATATGGATGG 


AACGATTGAA 


1650 


AAGAATTTAT 


CTGATTTCAT 


1700 


GGTACCAGAT 


TCAAGACCAG 


1750 


CTAGTCCAAG 


TCCGCAACCT 


1800 


CCAATTGATG 


AGAAATTGGT 


1850 


TTATGTCTTT 


GAGGAGAATG 


1900 


TTTCAGCAGA 


AACAGCAGCA 


1950 


AGTTTATCTC 


ATAAGCTAGG 


2000 


TCGAGAATTT 


TACAATAAGG 


2050 


ATTTACTTGA 


TAATAAAGGT 


2100 


CTGTTGGAAC 


GACTCAAGGA 


2150 


TGATATTCTT 


GCCTTCTTAG 


2200 


AACCAAATGC 


GCAAATTACC 


2250 


TTGGCAGGCA 


AGTACACAAC 


2300 


TATAACCAGT 


GATGAGGGGG 


2350 


GCCACTGGAT 


TAAAAAAGAT 


2400 
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AGTTTGTCTO AAGCTGAGAG AGCGGCAGCC 
AGGTTTGACC CCTCCTTCGA CAGACCATCA 
CAAAAGGAGC AG AAG CTATC TACAACCGCG 
CCACTTGATC GTATGC CTT A CAATCTTCAA 
CGGTAGTTTA ATCATACCTC ATTATGACCA 
AGTGGTTTGA CGAAGGCCTT TATGAGGCAC 
GATCTTTTGG CGACTGTCAA GTACTATGTC 
GCATTCAGAT AATGGTTTTG GTAACGCTAG 
AAAATGGTCA AGCTGATACC AATCAAACGG 
CCTCAGACAG AAAAACCTGA GGAAGAAACC 
AAGCGAGAAA CCAGAGTCTC CAAAACCAAC 
CACCAGAGGA ATCAGAAGAA CCTCAGGTCG 
AAACTGAGAG AGGCTGAAGA TTTACTTGGA 
CAAGTCCAAT GCCAAAGAGA CTCTCACAGG 
TTGGCACCCA GGACAACAAT ACTATTATGG 
GCTTTATTAA AGGAGAGTAA G (SEQ II 



CAGGCTTATG 


CTAAAGAGAA 


2450 


GGATTCAGGA 


AATACTGAGG 


2500 


TGAAAGCAGC 


TAAGAAGGTG 


2550 


TATACTGTAG 


AAGTCAAAAA 


2600 


TTACCATAAC 


ATCAAATTTG 


2650 


CTAAGGGGTA 


TACTCTTGAG 


2700 


GAACATCCAA 


ACGAACGTCC 


2750 


CGACCATGTT 


CAAAGAAACA 


2800 


AAAAACCAAG 


CGAGGAGAAA 


2650 


CCTCGAGAAG 


AGAAACCACA 


2900 


AGAGGAACCA 


GAAGAAGAAT 


2950 


AGACTGAAAA 


GGTTGAAGAA 


3000 


AAAATCCAGG 


ATCCAATTAT 


3050 


ATTAAAAAAT 


AA Ail ACT AT 


3100 


CAGAAGCTGA 


AAAACTATTG 


3150 


► NO : 76) 




3171 



FIGURE 41 



EAYWNGKQGS RPSSSSSYNA NPVQPRLSEN 
LRELYAKPLS ERHVESDGLI FDPAQITSRT 
SELEKRIARI IPLRYRSNHW VPDSRPEQPS 
PSNPIDEKLV KEAVRKVGDG YVFEENGVSR 
KQESLSHKLG AKKTDLPSSD REFYNKAYDL 
LDNLLERLiKD VSSDKVKLVD D I LAP LAP I R 
VAKLAGKYTT EDGYIFDPRD ITSDEGDAYV 
AAAQAYAKEK GLTPPSTDHQ DSGNTEAKGA 
NLQYTVEVKN GSLIIPHYDH YHNIKFEWFD 
YYVEHPNERP HSDNGFGNAS DHV (SEQ 



HNLTVTPTYH QNQGENISSL 50 

ARGVAVPHGN HYHFIPYEQM 100 

PQSTPEPSPS LQPAPNPQPA 150 

YIPAKDLSAE TAAGIDSKLA 200 

LARIHQDLLD NKGRQVDFEV 250 

HPERLGKPNA QITYTDDEIQ 300 

TPHMTHSHWI KKDSLSEAER 350 

EAIYNRVKAA KKVPLDRMPY 400 

EGLYEAPKGY SLEDLIATVK 450 

ID NO : 77) 473 
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CAYALNQHRS QENKDNNRVS YVDGSQSSQK SENLTPDQVS QKEGIQAEQI 50 
VIKITDQGYV TSHGDHYHYY NGKVPYDALF SEELLMKDPN YQLKDADIVN 100 
EVKGGYIIKV DGKYYVYLKD AAHADNVRTK OEIKRQKQEH VKDNEKVNSN 150 
VAVARSQGRY TTNDGYVFNP ADIIEDTGNA YIVPHGGHYH YIPKSDLSAS 200 
EIAAAKAHLA GKNMQPSQLS YSSTASDNNT QSVAKGSTSK PANKSENLQS 2 SO 

LLKELYDSPS AQRYSESDGL VFDPAKIISR TPNGVAIPHG DHYHFIPYSK 300 
LSALEEKIAR MVPISGTGST VSTNAKPNEV VSSLGSLSSN PSSLTTSKEL 
SSASDGYIFN PKDIVEETAT AYIVRHGDHF HYIPKSNQIG QPTLPNNSLA 
TPSPSLPINP GTSHEKHEED GYGFDANRII AEDESGFVMS HGDHNHYFFK 
KDLTEEQIKA AQKHLEEVKT SHNGLDSLSS HEQDYPGNAK EMKDLDKKIE 
EKIAGIMKQY OVKRESIWN KEKNAIIYPH GDHHHADPID EHKPVGIGHS 550 
HSNYELFKPE EGVAKKEGNK VYTGEELTNV VNLLKNSTFN NQNFTLANGQ 600 
KRVSFSFPPE LEKKLGINMIi VKLITPDGKV LEKVSGKVFG EGVGNIANFE 650 
LOQPYLPGQT FKYTIASKDY PEVSYDGTFT VPTSLAYKMA SQTIFYPFHA 700 
GDTYLRVNPQ FAVPKGTDAL VRVFDEFHGN AYLENNYKVG EIKLPIPKLN 
QGTTRTAGNK IPVTFMANAY LDNQSTYIVE (SEQ ID NO : 78) 

FIGURE 43 
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CAYELGLHQA QTVKENNRVS YIDGKQATQK TENLTPDEVS KREGINAEQI 50 

VIKITDQGYV TSHGDHYHYY NGKVPYDAII SEELLMKDPN YQLKDSDIVN 100 

EIKGGYVIKV NGKYYVYLKD AAHADNVRTK EEINRQKQEH SQHREGGTSA 150 

NDGAVAFARS QGRYTTDDGY IFNASDIIED TGDAYIVPHG DHYHYIPKNE 200 

LSASELAAAE AFLSGRENLS NLRTYRRQNS DNTPRTNWVP SVSNPGTTNT 250 

NTSNNSNTNS QASQSNDIDS LLKQLYKLPL SQRHVESDGL IFDPAQITSR 300 

TARGVAVPHG NHYHFIPYEQ MSELEKRIAR IIPLRYRSNH WVPDSRPEEP 350 

SPQPTPEPSP SPQPAPNPQP APSNPIDEKL VKEAVRKVGD GYVFEENGVS 400 

RYIPAKNLSA ETAAGIDSKL AKQESLSHKL GAKKTDLPSS DREFYNKAYD 450 

LLARIHQDLL DNKGRQVDFE ALDNLLERLK DVSSDKVKLV DDILAFLAPI 500 

RHPERLGKPN AQITYTDDEI QVAKLAGKYT TEDGYIFDPR DITSDEGDAY 550 

VTPHMTHSHW IKKDSLSEAE RAAAQAYAKE KGLTPPSTDH QDSGNTEAKG 600 
AEAIYNRVKA AKKVPLDRMP YNLQYTVEVK NGSLIIPHYD HYHNIKFEWF 
DEGLYEAPKG YTLEDLLATV KYYVEHPNER PHSDNGFGNA 
(SEQ ID NO : 79) 

FIGURE 44 
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GTGAAGAAAA 
GGAAGTTACC 
TATATTGATG 
ATCAGTGCTG 
TATGTGACCT 
ATTATTAGTG 
ATCAATGAAA 
AAGCCAGGTA 
AAAGGAACTA 
GAAGTTGCGG 
ATTTTTAGTC 
AATCACTATC 
GCCTACTGGA 
GCCCCAGGTC 
CAGCCAGATA 
AAACACCAAA 
CGTCTTGATT 
GTGATCAAAT 
CCAAGAAGTC 
ACTGAGGACA 
ACCTTTCTTG 
ACGAGTGATG 
ACAGCTAAAC 
TTGGATGAGG 
TTGGATCAGG 
GTAACAAAAG 
GCTCGTGATC 
GATAAGAAGC 
GATGTGTCAA 
GTTATCCCAC 
ATTGCAACAG 
CCAGGGCATG 
GGTATGCCAA 
GGTCGTTTTG 
ACTTTTGTAT 
ACCATTAATA 
AAGAAAAATA 
AAGAGCAATG 
TTTATAGACA 
CAATTAGCAC 
CAAT^TTATA 
AACCCTTAA 



CATATGGTTA TATCGGCTCA 
AACTTGGTAA GCATCATATG 
ACAGCAAAGG TAAGG CAAAA 
AAGAAGGCAT CTCTGCTGAA 
CACACGGTGA CCATTATCAT 
AAGAGTTGTT GATGACGGAT 
TCTTAGACGG TTACGTTATT 
GTAAGCGCAA AAACATTCGA 
AAGAAGCTAA AGAAAAAGGT 
CAGTCAATGA AGCAAAAAGA 
CGACAGATAT CATTGATGAT 
ATTATATTCC TAAAAAGGAT 
GTCAAAAACA AGGTCGAGGT 
GTAGGAAAGC CCCAATTCCT 
ACGGTGGCTA TCATCCAGCG 
GAGATGAGTT TAAAGGAAAA 
TGAAATACCG TCATGTGGAA 
CAAACGCTTT TGGGTATGTG 
AGTTATCACC TCTTGAAATG 
ATGACTCAGG TTCAGAGCAC 
GTCATCGCAT CAAAGCTTAC 
CTTATG TTTT TAGTAAAGAA 
ACGGAGATCA TTTCCACTAT 
TCGCTAACTG GGTGAAAGCA 
AACAAGGCAA AGAAAAACCA 
ATGGTAAAGT GGGCTATATG 
AACTTGATTT GACTCAGATT 
ATTACCGTTA TGACATTGTT 
GTCTGCCGAT GCATGCTGGT 
ATATTGATCA TATCCATGTC 
TCAAGTATGT GATGCAACAC 
AAGAGTCAGG TTCGGTCATT 
ACTGGCAAAT TATCCATTCT 
CAACACCAGA CGG CTATATT 
GGAAAGATGG CTCCTTTAGC 
AATCTGATCT ATCCCAAGCT 
CTGGTGATGC TACTGATACG 
AAAAC CAACA GCCAAGTGAA 
GTTTACCAGA CTATGGTCTA 
AAAAAGCTAA TATCGATCCT 
ATAAAAATGG TGAATTGGTA 
(SEQ ID NO : 60) 



GTTGCTGCCA 
GGTCTAGCAA 
GCCCCTAAAA 
CAGATCGTAG 
TTTTACAATG 
CCTAATTACC 
AAAGTCAATG 
ACCAAACAAC 
TTAGCTCAAG 
CAAGGACGCT 
TTAGGAGATG 
TTGTCTCCAA 
GCTAGACCGT 
GATGTGACGC 
CCTCCTAGGC 
ACCTTTAAGG 
GAAGATGGGT 
GTGCCTCATG 
GAATTAGCAG 
TCAAAACCAT 
GGAAAAGGCT 
TCCATTCATT 
ATAGGATTTG 
AAAGGTCAAG 
CTCTTTGACA 
ATG CCAAAAG 
GCCTTTGCCG 
GACACAGGTA 
AATGCTACTT 
GTTCCGTATT 
CCCGAAGTTC 
CCAAATGTTA 
GCTGAAGAAG 
TTCGATCCAC 
ATCCCAAGAG 
GAGTGGCAAC 
GATAAACCCA 
GCCAGTAAAG 
GAT AG AG C AA 
AAGTATCTCA 
ACTTATGATA 



TTTTACTAGC 
CAAAGGACAA 
CAAACAAAAC 
TCAAAATTAC 
GGAAAGTTCC 
GTTTTAAACA 
GCAACTATTA 
AAATTGCTGA 
TGGCCCATCT 
ATACTACAGA 
CTTATTTAGT 
GTGAGCTAGC 
CTGATTACCG 
CTAACCCTGG 
CAAATGATGC 
AACTTTTAGA 
TGATTTTTGA 
GAGATCATTA 
ATCGATACTT 
CAGATAAAGA 
TAGATGGTAA 
CAGTGGATAA 
GAGAACTTGA 
CTGATGAGCT 
CTAAAAAAGT 
ATGGTAAGGA 
AACAAGAACT 
TTGAGCCACG 
ACGATACTGG 
CATGGTTGAC 
GTCCGGATGT 
CGCCTCTTGA 
TTCAAAAAGC 
GAGATGTTTT 
CAGATGGCAG 
AAGCTCAAGA 
AAGAAAAGCA 
AAGAAAAAGA 
CCCTAGAAGA 
TTTTCCAACC 
TCAAGACACT 



TACTCATATT 
TCAGATTGCC 
GATGGATCAA 
TGACCAAGGC 
TTATGATGCG 
ATCAGACGTT 
TGTTTACCTC 
GCAAGTAGCC 
CAGTAAAGAA 
CGATGGCTAT 
ACCTCATGGT 
TGCTGCACAA 
CCCGACACCA 
ACAAGGTCAT 
GTCACAAAAC 
TCAACTACAC 
ACCGACTCAA 
TCATATTATC 
AGCTGGCCAA 
AGTGACACAT 
ACCATATGAT 
ATCAGGAGTT 
ACAATATGAG 
TGCTGCTGCT 
GAGTCGCAAA 
CTATTTCTAT 
AATGCTTAAA 
ACTTGCTGTA 
AAGTTCGTTT 
GCGCGATCAG 
ATGGTCTAAG 
TAAACGTGCT 
CCTAGCAGAA 
GGCCAAAGAA 
TTCATTGAGA 
GTTATTGGCA 
ACAGGCAGAT 
ATCAGATGAC 
TCATATCAAT 
AGAAGGTGTC 
TCAACAAATA 



60 
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240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
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1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
I960 
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VKKTYGYIGS VAAILLATHI GSYQLGKHHM GLATKDNQIA YIDDSKGKAK 50 

APKTNKTMDQ ISAEEGISAE QIWKITDQG YVTSHGDHYH FYNGKVPYDA 100 

IISEELLMTD PNYRFKQSDV INEILDGYVI KVNGNYYVYL KPGSKRKNIR ISO 

TKQQIAEQVA KGTKEAKEKG LAQVAHLSKE EVAAVNEAKR QGRYTTDDGY 200 

IFSPTDIIDD LGDAYLVPHG NHYHYIPKKD LSPSELAAAQ AYWSQKQGRG 250 

ARPSDYRPTP APGRRKAPIP DVTPNPGQGH QPDNGGYHPA PPRPNDASQN 300 

KHQROEFKGK TFKELLDQLH RLDLKYRHVE EOGLIFEPTQ VIKSNAFGYV 350 

VPHGDHYHII PRSQLSPLEM EIADRYLAGQ TEDNDSGSEH SKPSDKEVTH 400 

TFLGHRIKAY GKGLDGKPYD TSDAYVFSKE SIHSVDKSGV TAKHGDHFHY 450 

IGFGELEQYE LDEVANWVKA KGQADELAAA LDQEQGKEKP LFDTKKVSRK 500 

VTKDGKVGYM MPKDGKDYFY ARDQLDLTQI AFAEQELMLK DKKHYRYDIV 550 

DTGIEPRLAV DVSSLPMHAG NATYDTGSSF VIPHIDHIHV VPYSWtTRDQ 600 

IATVKYVMQH PEVRPDVWSK POHEESGSVI PNVTPLDKRA GMPKWQIIHS 650 

AEEVQKALAE GRFATPDGYI FDPRDVIAKE TFVWKDGSFS IPRADGSSI*R 700 

TINKSDLSQA EWQQAQELLA KKNTGDATDT DKPKEKQQAD KSNENQQPSE 750 

ASKEEKESDD FIDSLPDYGIi DRATLEDHIN QLAQKANIDP KYLIFQPEGV 800 

QFYNKNGELV TYDIKTLQQI NPP (SEQ ID NO : 81) 823 
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GTGAAGAAAA CATATGGTTA TATCGGCTCA GTTGCTGCCA TTTTACTAGC TACTCATATT 60 

GQAAGTTACC AACTTGGTAA GCATCATATG GGTCTAGCAA CAAAGGACAA TCAGATTGCC 120 

TATATTGATG ATAGCAAAGG TAAGGCAAAA GCCCCTAAAA CAAACAAAAC GATGGATCAA 180 

ATCAGTGCTG AAGAAGGCAT CTCTGCTGAA CAGATCGTAG TCAAAATTAC TGACCAAGGT 240 

TATGTGACCT CACACGGTGA CCATTATCAT TTTTACAATG GGAAAGTTCC TTATGATGCG 300 

ATTATTAGTG AAGAGTTGTT GATGACGGAT CCTAATTACC ATTTTAAACA ATCAGACGTT 360 

ATCAATGAAA TCTTAGACGG TTACGTTATT AAAGTCAATG GCAACTATTA TGTTTACCTC 420 

AAGCCAGGTA GTAAGCGCAA AAACATTCGA ACCAAACAAC AAATTGCTGA GCAAGTAGCC 480 

AAAGGAACTA AAGAAGCTAA AGAAAAAGGT TTAGCTCAAG TGGCCCATCT CAGTAAAGAA 540 

GAAGTTGCGG CAGTCAATGA AGCAAAAAGA CAAGGACGCT ATACTACAGA CGATGGCTAT 600 

ATTTTTAGTC CGACAGATAT CATTGATGAT TTAGGAGACG CTTATTTAOT ACCTCATGGT 660 

AATCACTATC ATTATATTCC TAAAAAAGAT TTGTCTCCAA GTGAGCTAGC TGCTGCACAA 720 

GCTTACTGGA GTCAAAAACA AGGTCGAGGT GCTAGACCGT CTGATTACCG CCCGACACCA 780 

GCCCCAGGTC GTAGGAAAGC TCCAATTCCT GATGTGACGC CTAACCCTGG ACAAGGTCAT 840 

CAGCCAGATA ACGGTGGCTA TCATCCAGCG CCTCCTAGGC CAAATGATGC GTCACAAAAC 900 

AAACACCAAA GAGATGAGTT TAAAGGAAAA ACCTTTAAGG AACTTTTAGA TCAACTACAC 960 

CGTCTTGATT TGAAATACCG TCATGTGGAA GAAGATGGGT TGATTTTTGA ACCGACTCAA 1020 

GTGATCAAAT CAAACGCTTT TGGGTATGTG GTGCCTCATG GAGATCATTA TCATATTATC 1080 

CCAAGAAGTC AGTTATCACC TCTTGAMTG GAATTAGCAG ATCGATACTT AGCCGGTCAA 1140 

ACTGAGGACA ATGATTCAGG TTCAGATCAC TCAAAACCAT CAGATAAAGA AGTGACACAT 1200 

ACCTTTCTTG GTCATCGCAT CAAAGCTTAC GGAAAAGGCT TAGATGGTAA ACCATATGAT 1260 

ACGAGTGATG CTTATGTTTT TAGTAAAGAA TCCATTCATT CAGTGGATAA ATCAGGAGTT 1320 

ACAGCTAAAC ACGGAGATCA TTTCCACTAT ATAGGATTTG GAGAACTTGA ACAATATGAG 1380 

TTGGATGAGG TCGCTAACTG GGTGAAAGCA AAAGGTCAAG CTGATGAGCT TGCTGCTGCT 1440 

TTGGATCAGG AACAAGGCAA AGAAAAACCA CTCTTTGACA CTAAAAAAGT GAGTCGCAAA -1500 

GTAACAAAAG ATGGTAAAGT GGGCTATATT ATGCCAAAAG ATGGCAAGGA CTATTTCTAT 1S60 

GCTCGTGATC AACTTGATTT GACTCAGATT GCCTTTGCCG AACAAGAACT AATGCTTAAA 1620 

GATAAGAACC ATTACCGTTA TGACATTGTT GACACAGGTA TTGAGCCACG ACTTGCTGTA 1680 

GATGTGTCAA GTCTGCCGAT GCATGCTGGT AATGCTACTT ACGATACTGG AAGTTCGTTT 1740 

GTTATCCCTC ATATTGATCA TATCCATGTC GTTCCGTATT CATGGTTGAC GCGCGATCAG 1800 

ATTGCAACAA TCAAGTATGT GATGCAACAC CCCGAAGTTC GTCCAGATGT ATGGTCTAAG 1860 

CCAGGGCATG AAGAGTCAGG TTCGGTCATT CCAAATGTTA CGCCTCTTGA TAAACGTGCT 1920 

GGTATGCCAA ATTGGCAMT CATCCATTCT GCTGAAGAAG TTCAAAAAGC CCTAGCAGAA 1980 

GGTCGTTTTG CAACACCAGA GGGCTATATT TTCGATCCAC GAGATGTTTT GGCCAAAGAA 2040 

ACTTTTGTAT GGAAAGATGG CTCCmAGC ATCCCAAGAG CAGATGGCAG TTCATTGAGA 2100 

ACCATTAATA AATCTGATCT ATCCCAAGCT GAGTGGCAAC AAGCTCAAGA GTTATTGGCA 2160 

AAGAAAAACG CTGGTGATGC TACTGATACG GATAAACCCA AAGAAAAGCA ACAGGCAGAT 2220 

AAGAGCAATG AAAACCAACA GCCAAGTGAA GCCAGTAAAG AAGAAGAAAA AGAATCAGAT 2280 
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VKKTYGYIGS VAAILLATHI GSYQLGKHHM GLATKDNQIA YIDDSKGKAK 50 

APKTNKTMDQ ISAEEGISAE QIWKITDQG YVTSHGDHYH FYNGKVPYDA 100 

IISEELLMTD PNYHFKQSDV INEILOGYVI KVNGNYYVYL KPGSKRKNIR i 50 

TKQQIAEQVA KGTKEAKEKG LAQVAHLSKE EVAAVNEAKR QGRYTTDDGY 200 

IFSPTDIIDD LGDAYLVPHG NHYHYIPKKD LSPSELAAAQ AYWSQKQGRG 250 

ARPSOYRPTP APGRRKAPIP DVTPNPGQGH QPDNGGYHPA PPRPNDASQN 300 

KHQRDEFKGK TFKELLDQLH RLDLKYRHVE EDGLIFEPTQ VIKSNAFGYV 350 

VPHGDHYHI I PRSQLSPLEM ELADRYLAGQ TEDNDSGSDH SKPSDKEVTH 400 

TFLGHRIKAY GKGUX3KPYD TSDAYVFSKE SIHSVDKSGV TAKHGDHFHY 450 

IGFGELEQYE LDEVAWWVKA KGQADELAAA LDQEQGKEKP LFDTKKVSRK 500 

VTKDGKVGYI MPKDGKDYFY ARDQLDLTQI AFAEQELMLK DKNHYRYDIV 550 

DTGIEPRLAV DVSSLPMHAG NATYDTGSSF VIPHIDHIHV VPYSWLTRDQ 600 

IATIKYVMQH PEVRPDVWSK PGHEESGSVI PNVTPLDKRA GMPNWQIIHS 650 

AEEVQKAIAE GRFATPDGYI FDPRDVLAKE TFVWKDGSFS IPRADGSSLR 700 

TINKSOLSQA EWQQAQELLA KKNAGDATDT DKPKEKQQAD KSNENQQPSE 750 

ASKEEEKESD DFIDSLPDYG LORATLEDHI NQLAQKANID PKYLIFQPEG 800 

VQFYNKNGEL VTYDIKTLQQ INPP (SEQ ID NO : 83) B24 
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Boi I Observations where certain claim* ware found unsearchable (Continuation of item 1 of first sheet) 



This International Starch Report has not bean established in respect of certain claims under Article 17(2)(a) for the following reasons; 



Although claims 26-32 are directed to a method of treatment of the 
human/animal body, the search has been carried out and based on the alleged 
effects of the compound/composition. 



because they relate to parts of the Internationa] Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



— because they are dependent claims and are not dratted in accordance with the second and third sentences of Rule 6.4(a), 
Box II Observations where unity of Invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows; 

see additional sheet 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
' — I searchable claims. 



2. [_J As ail searchable claims could be searched without effort justifying an adoftonai fee, this Authority did not invite payment 
of any additional fee. 



3. IT] As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
covers only those claims for which fees were paid, specifically claims Hot. : 




because they relate to subject matter not required to be searched by this Authority, namely: 






(l-32)partially f subjects 1 and 2; 



4, rj No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
— restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



International Application No. PCT/CA 99/01218 



FURTHER INF RMATI NC NTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1, Claims: (1-32) -partially 

An isolated polynucleotide encoding a polypeptide having at 
least 79%/90% identity to a second polypeptide having a 
sequence chosen from the BVH-3 family, respectively SEQ ID 
NOs.: 2,8,10,16,55-59,64-66, and 78 or fragments, analogs or 
derivatives thereof; said isolated polynucleotide encoding a 
polypeptide capable of generating antibodies having binding 
specificity for said polypeptide sequences; a vector 
comprising said polynucleotides; a host cell transfected 
with said vector; a process for producing said polypeptides; 
a chimeric polypeptide comprising two or more polypeptides 
chosen from said SEQ ID Nos. 2,8,10,16,55-59,64-66, and 78; 
a vaccine composition comprising said polypeptides; a method 
for therapeutic or prophylactic treatment of meningitis, 
otis media, bacteremia or pneumonia infection in an 
individual; use of a vaccine composition for said treatment; 



2. Claims: (l-32)-partially 

Idem as subject 1, but limited to the BVH-11 family 
respectively SEQ ID NOs. 4,14,58,60-63,67-75,77 and 79; 



3. Claims: (l-32)-partially 

Idem as subject 1, but limited to the BVH-28 family 
respectively SEQ ID NO. 6; 



4. Claims: (l-32)-partialy 

Idem as subject 1, but limited to the BVH-71 family 
respectively SEQ ID NOs. 81 and 83; 
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